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To Arianna, Susanna and Chiara, Alessandro, Cecilia 


Preface 


The purpose of this textbook is to present an array of topics that are found in 
the syllabus of the typical second lecture course in Calculus, as offered in many 
universities. Conceptually, it follows our previous book Mathematical Analysis I, 
published by Springer, which will be referred to throughout as Vol. I. 

While the subject matter known as ‘Calculus 1’ concerns real functions of real 
variables, and as such is more or less standard, the choices for a course on ‘Calculus 
2’ can vary a lot, and even the way the topics can be taught is not so rigid. Due 
to this larger flexibility we tried to cover a wide range of subjects, reflected in 
the fact that the amount of content gathered here may not be comparable to the 
number of credits conferred to a second Calculus course by the current programme 
specifications. The reminders disseminated in the text render the sections more 
independent from one another, allowing the reader to jump back and forth, and 
thus enhancing the book’s versatility. 

The succession of chapters is what we believe to be the most natural. With 
the first three chapters we conclude the study of one-variable functions, begun in 
Vol. I, by discussing sequences and series of functions, including power series and 
Fourier series. Then we pass to examine multivariable and vector-valued functions, 
investigating continuity properties and developing the corresponding integral and 
differential calculus (over open measurable sets of R” first, then on curves and 
surfaces). In the final part of the book we apply some of the theory learnt to the 
study of systems of ordinary differential equations. 

Continuing along the same strand of thought of Vol. I, we wanted the present- 
ation to be as clear and comprehensible as possible. Every page of the book con- 
centrates on very few essential notions, most of the time just one, in order to 
keep the reader focused. For theorems’ statements, we chose the form that hastens 
an immediate understanding and guarantees readability at the same time. Hence, 
they are as a rule followed by several examples and pictures; the same is true for 
the techniques of computation. 

The large number of exercises, gathered according to the main topics at the 
end of each chapter, should help the student test his improvements. We provide 
the solution to all exercises, and very often the procedure for solving is outlined. 
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Some graphical conventions are adopted: definitions are displayed over grey 
backgrounds, while statements appear on blue; examples are marked with a blue 
vertical bar at the side; exercises with solutions are boxed (e.g., | 12. |). 

This second edition is enriched by two appendices, devoted to differential and 
integral calculus, respectively. Therein, the interested reader may find the rigorous 
explanation of many results that are merely stated without proof in the previous 
chapters, together with useful additional material. We completely omitted the 
proofs whose technical aspects prevail over the fundamental notions and ideas. 
These may be found in other, more detailed, texts, some of which are explicitly 
suggested to deepen relevant topics. 

All figures were created with MATLAB™ and edited using the freely-available 
package psfrag. 

This volume originates from a textbook written in Italian, itself an expanded 
version of the lecture courses on Calculus we have taught over the years at the 
Politecnico di Torino. We owe much to many authors who wrote books on the 
subject: A. Bacciotti and F. Ricci, C. Pagani and S. Salsa, G. Gilardi to name a 
few. We have also found enduring inspiration in the Anglo-Saxon-flavoured books 
by T. Apostol and J. Stewart. 

Special thanks are due to Dr. Simon Chiossi, for the careful and effective work 
of translation. 

Finally, we wish to thank Francesca Bonadei — Executive Editor, Mathem- 
atics and Statistics, Springer Italia — for her encouragement and support in the 
preparation of this textbook. 


Torino, August 2014 Claudio Canuto, Anita Tabacco 
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Numerical series 


This is the first of three chapters dedicated to series. A series formalises the idea of 
adding infinitely many terms of a sequence which can involve numbers (numerical 
series) or functions (series of functions). Using series we can represent an irrational 
number by the sum of an infinite sequence of increasingly smaller rational numbers, 
for instance, or a continuous map by a sum of infinitely many piecewise-constant 
functions defined over intervals of decreasing size. Since the definition itself of 
series relies on the notion of limit of a sequence, the study of a series’ behaviour 
requires all the instruments used for such limits. 

In this chapter we will consider numerical series: beside their unquestionable 
theoretical importance, they serve as a warm-up for the ensuing study of series 
of functions. We begin by recalling their main properties. Subsequently we will 
consider the various types of convergence conditions of a numerical sequence and 
identify important classes of series, to study which the appropriate tools will be 
provided. 


1.1 Round-up on sequences 


We briefly recall here the definition and main properties of sequences, whose full 
treatise is present in Vol. I. 

A real sequence is a function from N to R whose domain contains a set 
{n €N : n> no} for some integer no > 0. If one calls a the sequence, it is 
common practice to denote the image of n by a, instead of a(n); in other words 
we will write a: n +> a,. A standardised way to indicate a sequence is {an }n>no 
(ignoring the terms with n < no), or even more concisely {a}. 

The behaviour of a sequence as n tends to oo can be classified as follows. The 
sequence {dn}n>n. converges (to ¢) if the limit Jim, dy, = & exists and is finite. 
When the limit exists but is infinite, the sequence is said to diverge to +00 or 
—oo. If the sequence is neither convergent nor divergent, i.e., if im Gn does not 


exist, the sequence is indeterminate. 
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The fact that the behaviour of the first terms is completely irrelevant justifies 
the following definition. A sequence {an}n>n, satisfies a certain property even- 
tually, if there is an integer N > no such that the sequence {a,,}n>n satisfies the 
property. 

The main theorems governing the limit behaviour of sequences are recalled 
below. 


Theorems on sequences 


. Uniqueness of the limit: if the limit of a sequence exists, it is unique. 

. Boundedness: a converging sequence is bounded. 

. Existence of the limit for monotone sequences: an eventually-monotone 
sequence is convergent if bounded, divergent if unbounded (divergent to 
+oo if increasing, to —oo if decreasing). 

. First Comparison Theorem: let {an}, {b,} be sequences with finite or 


infinite limits lim a, = @ and lim b, = m. If ay < by, eventually, then 
l Z NCO nN—- oo 
ir 


. Second Comparison Theorem - finite case: let {an}, {bn} and {c,} be 


sequences with lim a, = lim c, = €E€ R. If an < bn < cy eventually, 
noo VAD 


then lim 0, = £. 
Noo 
. Second Comparison Theorem - infinite case: let {an}, {bn} be sequences 
such that im Gn = +00. If an < by eventually, then im bn = too. A 
similar result iholde if the limit is —oo: lim b, = —oo ilies im CC —— 
N—->co 
—0o. 
. Property: a sequence {a,} is infinitesimal, that is dim a, = 0, if and only, 
if the sequence of absolute values {|a,,|} is ome 
. Theorem: if {ap} is infinitesimal and {b, } bounded, {anbp } is ee 
. Algebra of limits: let {an}, {bn} be such that im G, = and lim b, 


Noo 
(¢, m finite or infinite). Then 


lin (a, = 0; )—t =m, 
n—- oo 


lim a, 0, —£ m1, 
noo 
if b, ~ 0 eventually, 


whenever the right-hand sides are defined. 
. Substitution Theorem: let {an} be a sequence with lim a, = @ and sup- 
n—->co 


pose g is a function defined on a neighbourhood of ¢: 
a) if 2€ R and g is continuous at @, then lim Glan) = ge) 


b) if 2¢ R and lim, g(x) = m exists, then lim jG) — mn, 
Bird N+ CO 
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10. Ratio Test: let {a,} be a sequence for which a,, > 0 eventually, and suppose 
the limit 


; An+1 
lim —2 


N—-+0O An 
exists, finite or infinite. If g<1then lim a, =0;ifg>1then lim a, = 
se nNn—- co noo 
Cor 


Let us review some cases of particular relevance. 


Examples 1.1 


i) Consider the geometric sequence a, = q", where q is a fixed number in R. In 
Vol. I, Example 5.18, we proved 


does not exist 


ii) Let p > 0 be a given number and consider the sequence y/p. Using the 
Substitution Theorem with g(x) = p” we have 


lim Y/p= lim pil” =p =1. 


N—->oCo N+ Co 


iii) Consider now (/n; again using the Substitution Theorem, 


| 
lm W%/n= lim exp ~e~ =e? =1. 
noo n—oo n 


1\7 
iv) The number e may be defined as the limit of the sequence ay, = (1 + | . 


which converges since it is strictly increasing and bounded from above. 


v) At last, look at the sequences, all tending to +00, 


(one ange ul). 


In Vol. I, Sect. 5.4, we proved that each of these is infinite of order bigger than 
the preceding one. This means that for n > oo, with a > 0 and q > 1, we have 


logn = o(n®), na” = og"). 


mr 


¢g” =o(n!), nl=o(n™). 


4 1 Numerical series 


1.2 Numerical series 


To introduce the notion of numerical series, i.e., of “sum of infinitely many num- 
bers”, we examine a simple yet instructive situation borrowed from geometry. 

Consider a segment of length ¢ = 2 (Fig. 1.1). The middle point splits it into two 
parts of length ag = @/2 = 1. While keeping the left half fixed, we subdivide the 
right one in two parts of length a; = ¢/4 = 1/2. Iterating the process indefinitely 
one can think of the initial segment as the union of infinitely many segments of 
lengths 1, $, 4, 3, 74,--- Correspondingly, the total length of the starting segment 
can be thought of as sum of the lengths of all sub-segments, in other words 


a or | 1 
2=1+-+-+-4+-—-+... 1.2 
T5916 ee) 


On the right we have a sum of infinitely many terms. This infinite sum can be 
defined properly using sequences, and leads to the notion of a numerical series. 


Given the sequence {ax }k>0; one constructs the so-called sequence of partial 
sums {s,,}n>0 in the following manner: 


59 = a0, Ss; =ant+a1, $2 =ag + a1 +42, 


and in general, 


Note that s, = 5,_1 +@n. Then it is natural to study the limit behaviour of such 
a sequence. Let us (formally) define 


The symbol S- ax is called (numerical) series, and a, is the general term of 
k=0 
the series. 


Figure 1.1. Successive splittings of the interval [0, 2]. The coordinates of the subdivision 
points are indicated below the blue line, while the lengths of sub-intervals lie above it 
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n 
Definition 1.2 Given the sequence {ax}x>0 and sp = So ax, consider the 
k=0 
limit lim Sp. 
N—- oo 
i) If the limit exists (finite or infinite), its value s is called sum of the 
series and one writes 
[o-@) 
Sus Se= lim Sp. 
noo 
k=0 


CO 
- If s is finite, one says that the series So ax converges. 
k=0 


[e.) 
- Ifs is infinite, the series So ak diverges, to either +00 or —oo. 
k=0 


CO 
ii) If the limit does not exist, the series yS ay is indeterminate. 
k=0 


Examples 1.3 


i) Let us go back to the interval split infinitely many times. The length of the 
shortest segment obtained after k + 1 subdivisions is a, = x k > 0. Thus, we 


CO 
iL 
consider the series OK Its partial sums read 


k=0 
1 3 1 1 7 
SQ : 81 + 5 5? 82 vay a 7? 
ae a a2) 
— BET on 


Using the fact that a”*1 — 6"*1 = (a — b)(a" + a” 104+... + ab"-1 + 6"), and 
choosing a = 1 and b = & arbitrary but different from one, we obtain the identity 


lag 
1 or we 1.3 
i rea i (1.3) 
Therefore 
ae 1 1 
,=l _ — ant = _ — ooo ae 
and so 
1 
lim s, = lim (2-5) = 2. 
noo noo Qn 


The series converges and its sum is 2, which eventually justifies formula (1.2). 
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ii) The partial sums of the series S>(-1)* satisfy 


k=0 
Sso=l, 
Ssg=s,t+1l=1, 
Son = 1, 


s3 =1-1=0, 


63 = s9-1=0, 


S2n+1 = 0. 


The terms with even index are all equal to 1, whereas the odd ones are 0. 
Therefore lim s,, cannot exist and the series is indeterminate. 
N— Oo 


iii) The two previous examples are special cases of the following series, called 
geometric series, 


[o) 
k 
eee 
k=0 
where q is a fixed number in R. The geometric series is particularly important. 


Ifqg=1, then s, =ap+a,+...¢a, =14+1+4+...41=n+land lim s, = +o. 
Hence the series diverges to +00. UAT 


If ¢ # 1 instead, (1.3) implies 


_ go 
Sp =lt+qt¢@st...t+q= 
Dg 
Recalling Example 1.1 i), we obtain 
1 
—- if |q| <1, 
; ; f= gut 1- qd 
Paes ag ee ifq>1, 


does not exist ifq<-—1l. 
In conclusion 


converges to 7 


diverges to +00 


is indeterminate 


Sometimes the sequence {a,} is only defined for k > ko: Definition 1.2 then 
modifies in the obvious way. Moreover, the following fact holds, whose easy proof 
is left to the reader. 


Property 1.4 A series’ behaviour does not change by adding, modifying or 


eliminating a finite number of terms. 
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Note that this property, in case of convergence, is saying nothing about the 
sum of the series, which generally changes when the series is altered. For instance, 


Examples 1.5 


— 1 
i) The series >. th—ik is called series of Mengoli. As 
k=2 


it follows that 


1 
a ae 5} 


1 
=1--. 
n 
Thus 
: : 1 
lim s, = lm (1-+) =1 
N— Co noo nr 
and the series converges to 1. 
— 1 
ii) For the series » log (1 + -) we have 
k=1 ‘ 
1 k+1 
ar = log (1 + 7) = log = log(k +1) —logk, 
SO 
s, = log2, so = log2 + (log3 — log2) = log3, 


Sn = log2+ (log3 — log2)+... + (log(n + 1) - log n) = log(n +1). 
Hence 


lim s, = lim log(n+1) = +00 
noo noo 


and the series diverges (to +00). 


The two instances just considered belong to the larger class of telescopic 
series. These are defined by ax, = by41 — bx for a suitable sequence {bg }k>ko- 
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Since Sp = bn+41 — by,, the behaviour of a telescopic series is the same as that of 
the sequence {bx}. 

There is a simple yet useful necessary condition for a numerical series to con- 
verge. 


[o-2) 
Property 1.6 Let 3 ap be a converging series. Then 


k=0 ; 
lim az, = 0. 
k-0o 


Proof. Let s= lim s,. Since ax = sz — Sx_1, 
noo 


lim ay = lim (sz, — $g-1) = 8 —S =0. 
k—- oo k- oo 


Observe that condition (1.4) is not sufficient to guarantee convergence. The 
general term of a series may tend to 0 without the series having to converge. 


—. 1 
For example we saw that S- log (1 + =) diverges (Example 1.5 ii)), while the 
k=1 


il 
continuity of the logarithm implies that jim log (1 + =z) = 0. 
00 


Example 1.7 
It is easy to see that S- (1 — =) does not converge, because the general term 
k=1 


k 
ap = (1 — t) tends to e~! + 0. 


[o.@) 
If a series > dz converges to s, the quantity 
k=0 


is called nth remainder. 
Now comes another necessary condition for convergence. 


[o-2) 
Property 1.8 Let ys az be a convergent series. Then 
k=0 


hin ?, =O. 
nN—-oo 


Proof. Just note lim r, = lim (s—s,)=s—s=0. 
nN—- oo n—- oo 
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[o) 
It is not always possible to predict the behaviour of a series > az using merely 


k= 
the definition. It may well happen that the sequence of partial ies cannot be com- 
puted explicitly, so it becomes important to have other ways to establish whether 
the series converges or not. In case of convergence it could also be necessary to 
determine the actual sum explicitly. This may require using more sophisticated 
techniques, which go beyond the scopes of this text. 


1.3 Series with positive terms 


We deal with series ay for which az, > 0 for any k EN. 
k=0 


CO 


Proposition 1.9 A series Ee a, with positive terms either converges or di- 


k=0 
verges to +00. 


Proof. The sequence s, is monotonically increasing since 
Snt1 = 8n + Qn41 = Sn, Yn > 0. 


It is then sufficient to use Theorem 3 on p. 2 to conclude that lim s,, 
noo 


exists, and is either finite or +00. Oo 


We list a few tools for studying the convergence of positive-term series. 


CO CO 

Theorem 1.10 (Comparison Test) Let ye a, and De by be positive-term 
k=0 k=0 

series such that 0 < az < bx, for any k > 0. 


CO CO 
poate y be converges, also y Gk converges and 
k=0 k=0 6 


Sax < Sob 
k=0 k 


=0 


Vijay. Se ax diverges, then iy by diverges as well. 
k=0 k=0 
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Proof. i) Denote by {s,} and {t,} the sequences of partial sums of > ak, >. by 


k=0 k=0 
respectively. Since ay, < by for all k, 
SS tas Vn > 0 
[o.@) 
By assumption, the series Se converges, so lim t, = t € R. Propos- 
R= So 


k=0 
ition 1.9 implies that lim s, = s exists, finite or infinite. By the First 
noo 


Comparison Theorem (Theorem 4, p. 2) we have 
s= lim s, < lim t, =teER. 
noo n—-oo 


CO 
Therefore s € R, and the series >, dz converges. Furthermore s < t. 
k=0 


co [o.@) 
ii) By contradiction, if So be converged, part i) would force So ar to 


k=0 k=0 
converge too. Oo 
Examples 1.11 
1 
i) Consider S- om As 

k=1 

1 1 

<< : VA a2, 
k?  (k-1)k ~ 
are! 


and the series of Mengoli S- converges (Example 1.5 i)), we conclude 


(k —1)k 
k=2 
that the given series converges to a sum < 2. One can prove the sum is precisely 


~ (see Example 3.18). 


(oe) 


1 
ii) The series > k is known as harmonic series. Since log(1+x) <2,Vx>—-1, 


k=1 
(Vol. I, Ch. 6, Exercise 12), it follows 
1 1 
log (i) = Vek > 1; 
Pe ba) eae 
= 1 
but since S- log (1 + 7) diverges (Example 1.5 ii)), we conclude that the har- 


k=1 
monic series diverges. 


iii) Subsuming the previous two examples we have 


| 

y —, a O> (1.5) 
ko 

k=1 
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called generalised harmonic series. Because 
1 iF ih 1 
aaa for 0<a<l, —< = 
a ns ke ~ #2 
the Comparison Test tells us the generalised harmonic series diverges for 0 < a < 
1 and converges for a > 2. The case 1 < @ < 2 will be examined in Example 1.19. 


for a > 2, 


Here is a useful criterion that generalises the Comparison Test. 


Theorem 1.12 (Asymptotic Comparison Test) Let Ss ap and ye by be 


k=0 k=0 
positive-term series and suppose the sequences {ax}xn>o0 and {by}n>0 have the 
same order of magnitude for k + oo. Then the series have the same behaviour. 


Proof. Having the same order of magnitude for k + oo is equivalent to 


lim “ =¢eR\ {0}. 
k-oo by 
Qk by, 
Therefore the sequences « — and 4 — are both convergent, 
k) K>0 Gk J K>0 


hence both bounded (Theorem 2, p. 2). There must exist constants 
M,, Mz > 0 such that 


b bp 
cK <M, and |—|< Mp 
by Qk 
for any k > 0, i.e., 
|ax| ss My, |bz| and |b;.| <— Mo\ax| : 
Now it suffices to use Theorem 1.10 to finish the proof. Oo 
Examples 1.13 
a . k+38 1 
i) Consider S- ak = S- eB and let b, = . Then 
k=0 k=0 
i zk 1 
ae by 7 2 


and the given series behaves as the harmonic series, hence diverges. 


= ol i al 
ii) Take the series s “n= >. sin oe As sin = ~ = for k — oo, the series has 
k=1 k=1 


k2 
[o.) 
the same behaviour of y oe so it converges. 
k=1 


12 1 Numerical series 


Eventually, here are two results — of algebraic flavour and often easy to employ — 
which prescribe sufficient conditions for a series to converge or diverge. 


CO 
Theorem 1.14 (Ratio Test) Let the series So ax have a, > 0, Vk > 0. 


Assume the limit k=0 


; Qk4+1 
lin Sy 


k-+co Qk 


exists, finite or infinite. If <1 the series converges; if £ > 1 it diverges. 


Proof. First, suppose @ finite. By definition of limit we know that for any « > 0, 
there is an integer kz > 0 such that 


CLL. 
VA>k => at <eé ie, fate elite, 
Qk Qk 
Assume ¢ < 1. a set g = +4, Se) 
je , <lt+e=q, VA > Re 
ak 


Repeating the argument we obtain 


2 k—ke 
Ak+1 < Gap <Q ap-1<.--<q “Qk, 41 


hence Ake+1 
Appi <q, Wk > ke. 


The claim follows by Theorem 1.10 and from the fact that the geometric 
series, with q < 1, converges (Example 1.3). 
Now consider £ > 1. Choose e = ¢— 1, and notice 

jaf=22°™. ok. 

Ak 

Thus ap41 > Gk >... > Ak. +1 > 0, so the necessary condition for conver- 
gence fails, for tim tnt, 
Eventually, if ea = = we put A = 1 in the condition of limit, and there 


exists k4 > 0 with ax > 1, for any k > kag. Once again the necessary 
condition to have convergence does not hold. 


CO 
Theorem 1.15 (Root Test) Given a series So ax wil ae = OU, Vk SU, 
k=0 
ul lian vag — 0 
k—-00 


exists, finite or infinite. If <1 the series converges, if £ > 1 it diverges. 


Proof. This proof is essentially identical to the previous one, so we leave it to the 
reader. O 
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Examples 1.16 


ik k k+1 
i) For d ak We have az = aE and ap41 = men? therefore 
1 1 1 
i ee ee 
k-+co Ak k>00 3 ~ CUdk 3 


The given series converges by the Ratio Test 1.14. 


ae 
ii) The series y ER has 
k=1 


The Root Test 1.15 ensures that the series converges. 


We remark that the Ratio and Root Tests do not allow to conclude anything 


if 2 = 1. For example, > : diverges and » z converges, yet they both satisfy 
Theorems 1.14 and 1.15 with @= 1. 

In certain situations it may be useful to think of the general term a, as the value 
at « =k of a function f defined on the half-line [ko, ++oo). Under the appropriate 
assumptions, we can relate the behaviour of the series to that of the integral of f 
over [kg, +00). In fact, 


Theorem 1.17 (Integral Test) Let f be continuous, positive and decreasing 
on |ko, +00), for ko € N. Then 


oo +oo fore) 
S> fk) < | f(a) de < J f(k). (1.6) 


eae ko =o 


Therefore the integral and the series share the same behaviour: 
ALes love) 

a) f(x)dx converges <=> > f(k) converges; 
k=ko 


b) faida dwerges <=> Se f(k) diverges. 
k=ko 


Proof. Since f decreases, for any k > ko we have 
fR+1)<f@)<sf(k), Vaeelk,k+1], 


and as the integral is monotone, 


k+1 
f(k +1) < | Hated Fh). 
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Then for all n € N with n > ko we obtain 


np n+l n 
S> fk =) f(a)ax < SDF) 
byt = 


(after re-indexing the first series). Passing to the limit for n — +00 and 
recalling f is positive and continuous, we conclude. 0 


From inequalities (1.6) it follows easily that 


dade +e 
[fs Oa<> @ ) < f(ko) ff f(a) dz. 


0 k=ko 


Comparing with the improper integral of f allows to estimate, often accurately, 
the remainder and the sum of the series, and use this to estimate numerically these 
values: 


CO 


Property 1.18 Under the assumptions of Theorem 1.17, if De f(k) 


k=ko 
converges then for all n > ko 


(1.7) 


Proof. If > f(k) converges, (1.6) can we re-written substituting ko with any 
k=ko 
integer n > ko. Using the first inequality, 


+00 
fn = S— Sy = 3 f(k <| flx)de, 
k=n-+1 


while changing kp to n+ 1 in the second one yields 


- dees 


a 


This gives formula (1.7), from which (1.8) follows by adding s, to each 
side. O 


Examples 1.19 
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i) The Integral Test is used to study the generalised harmonic series (1.5) for all 


1 
admissible values of the parameter a. Note in fact that the function — ,a> 0, 
x 


fulfills the hypotheses and its integral over [1, +00) converges if and only if a > 1. 


In conclusion, 


Ss il { converges if a > 1, 


| k® | diverges if0<a<1. 


ii) In order to study 


we take the map f(x) = 


[e-e) 
Consequently, the series Ss" 
k=2 


=~. il 
2. Elogk 


; its integral over [2, +00) diverges, since 


x log x 


+00 1 +00 1 
| f= / — dt = +00. 
2 x log x log 2 t 


is divergent. 


up to the first 10 terms. 


and 


klogk 
iii) Suppose we want to estimate the precision of the sum of S- 3 computed 
k=1 
+00 1 
We need to calculate / f(x) dx with f(x) = =: 
im ie 
ia dy= ii seen 
nw Testo | De? nn 
= < = dc= — 0.005 
T10 = §— §10 & 1 (e *= 3(10)2 — , 
+00 1 
a —; dx = ——~ = 0.004132... 
ro me 21 


The sum may be estimated with the help of (1.8): 


Since 


we find 1.201664 < s < 1.202532. The exact value for s is 1.202057... 


1 1 
—— <s< ————-.. 
“OY 5Ce—° - SG0e 


1 1 
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1.4 Alternating series 


These are series of the form 


CO 


S\(-1)"b, with bp > 0, Vk >0. 


k=0 


For them the following result due to Leibniz holds. 


Theorem 1.20 (Leibniz’s Alternating Series Test) An alternating 


(—1)*b, converges if the following conditions hold 


ii) the sequence {by}n>0 decreases monotonically . 


Denoting by s its sum, for alln > 0 one has 


Pn = Is = S| s Bn41 and pri SSS Sd, - 


Proof. As {b,}x%>0 is a decreasing sequence, 
82n, = 82n—2 — ban—1 + ban = 82n—2 — (ban—1 — ban) < Sen—2 


nh = 
and S544 = Signet + Oey — Canin 2 San—1: 


Thus the subsequence of partial sums made by the terms with even index 
decreases, whereas the subsequence of odd indexes increases. For any n > 


0, moreover, 
San = 8an—1 + ban 2 San-1 2 -~+ 2 $1 


nh _ 
and Soi = Ste — Boye & San Sasa S Cos 


Thus {52n}n>o0 is bounded from below and {s2n41}n>0 from above. By 
Theorem 3 on p. 2 both sequences converge, so let us put 


lim 24, = iit a, =a" and litt 2643.7 = SUP Soy01 = By. 
noo n>0 n—-oco n>0 
Since : F 
s* —s, = lim (son - Son41) = lin bo, ty =O, 
n—-oo V6 Joe aS. 2) 
(oe) 
we conclude that the series y (—1)*b, has sum s = s* = s,. In addition, 
k=0 


Sona  S & Boy,, wmv, 
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in other words the sequence {52n}n>0 approximates s from above, while 
{S2n+1}n>0 approximates s from below. For any n > 0 we have 


D8 = Song & 9nso — Gon = Orns 


and 
DS aig — 8 = Say — Bana = logs; 


so |rn| = |S — Sn| < bn41- Oo 


Examples 1.21 


1 
i) Consider the generalised alternating harmonic series Ds where 
k=1 


a>0.As jim, b, = im = = 0 and the sequence {4 } k>1 is strictly decreasing, 
the series converges. 
ii) Condition i) in Leibniz’s Test is also necessary, whereas ii) is only sufficient. 
In fact, for k > 2 let 
_ fi/k k even, 
SV (k= 1)/k? k odd. 
It is straightforward that b, > 0 and by is infinitesimal. The sequence is not 
monotone decreasing since by > by41 for k even, by < by41 for k odd. Neverthe- 
less, 


2b = ae 7 Ga 
= = k>3 


k odd 
converges, for the two series on the right converge. 


(=1)* 


k} 


[o2) 
iii) We want to approximate the sum of > to the third digit, meaning 
k=0 
with a margin less than 10~*. The series, alternating for b, = a converges by 


Leibniz’s Test. From |s — sp| < bn41 we see that for n = 6 


O<sg—s<by= = 0.0002 < 107°. 


5040 ~ 5000 
As sg = 0.368056..., the estimate s ~ 0.368 is correct up to the third place. 


To study series with arbitrary signs the notion of absolute convergence is useful. 


[o2) 
Definition 1.22 The series So ak converges absolutely if the positive- 


aS k=0 


term series y lax] converges. 
k=0 
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Example 1.23 


(oe) CO 
1 1 
The series y rs converges absolutely because y RB converges. 
k=0 k=0 


The next fact ensures that absolute convergence implies convergence. 


[e-e) 
Theorem 1.24 (Absolute Convergence Test) Jf So aK converges abso- 
k=0 


lutely then it also converges, and 


Proof. The proof is similar to that of the Absolute Convergence Test for improper 


integrals. 
Define sequences 
# ve ifa, > 0 _ ‘° ifa, > 0 
a, = and a, = 
0 ifa,<0 —a, ifa, <0 


Note a .a, = 0 for all & > 0, and 
a, =a, —a, , |a,| =af +a; . 


As0< a a, < |ax|, for any k > 0, the Comparison Test (Theorem 1.10) 


tells us that > c and a a, converge. But for any n > 0, 


nm n mr 
= > a 
San = 5 (a f-a,) = lat - doa, 
[o.@) 
so also the series . an = 3 aj — y a, converges. 


k=0 
Passing now to ‘i ‘iol abs ; —ooin 


n 
iC = >, lax], 
k=0 


we obtain the desired inequality. 


1.5 The algebra of series 19 


Remark 1.25 There are series that converge, but not absolutely. The alternating 
[o) 


1 
harmonic series Ss >(-1)* rs is one such example, for it has a finite sum, but does not 


k=1 
foe) 


converge absolutely, since the harmonic series y i diverges. In such a situation 
k=1 
one speaks about conditional convergence. Oo 


The previous criterion allows one to study alternating series by their absolute 
convergence. As the series of absolute values has positive terms, all criteria seen 
in Sect 1.3 apply. 


1.5 The algebra of series 


Two series Sak, Sk can be added, multiplied by numbers and multiplied 


k=0 k=0 
between themselves. The sum is defined in the obvious way as the series whose 


formal general term reads cy, = ax + Dp: 
Saat Yoh Slou+ hy) 
k=0 


Assume the series both converge or diverge, and write s = S- ak, i= S be 

k=0 k=0 
(s,t € RU {+00 }). The sum is determinate (convergent or divergent) whenever 
the expression s + t is defined. If so, 


S-( (an + by) =s+t 
k=0 


and the sum converges if s+ t € R, diverges if s + t = too. 
If one series converges and the other is indeterminate the sum is necessarily 
indeterminate. 
Apart from these cases, the nature of the sum cannot be deduced directly from 
the behaviour of the two summands, and must be studied case by case. 
[o.@) 


Let now » € R \ {0}; the series AS ax is by definition the series with gen- 
k=0 


[o-e) 
eral term Aa,;. Its behaviour coincides with that of > Ge Anyhow, in case of 


. k=0 
convergence or divergence, 


5 AaK = AS. 
k=0 
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In order to define the product some thinking is needed. If two series converge 
respectively to s,t € R we would like the product series to converge to st. This 
cannot happen if one defines the general term cz, of the product simply as the 
product of the corresponding terms, by setting cy = axbz. An example will clarify 
the issue: consider the two geometric series with terms ay = x and by = ar: Then 


NT 1 | 1 3 

ye —2 So == a 
k _t ’ k _t 

rare 1-4 <3 1-i 2 

while 

ae | nae | 1 6 3 

espa Ee 

mon a0” ie: B 


One way to multiply series and preserve the above property is the so-called 
Cauchy product, defined by its general term 


k 
Ck = ys ajbp—; = aobe + ay bg—-1 + +++ + @x—1b1 + axbo. (1.9) 
j=0 


By arranging the products agbm, (€,m > 0) in a matrix 


bo by bo 
ao agbo aob1 aob2 ates 
ar nee < 
ay abo ay,b1 a be 
ar 


ag agbo agby agbs 


yi 


each term c, becomes the sum of the entries on the Ath anti-diagonal. 
The reason for this particular definition will become clear when we will discuss 
power series (Sect. 2.4.1). 


[o-2) [o.2) 
It is possible to prove that the absolute convergence of So ax and S- be is 
es k=0 k=0 
sufficient to guarantee the convergence of S- Cr, in which case 
k=0 
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1.6 Exercises 


1. Find the general term a,, of the following sequences, and compute lim ay: 
Noo 


L 28 A 23 4 5 
~,-=,-,-,... b) -=,-,-—=,-—,... Ds Wy V2 85 Wy ad 
a) 9” 3” 4’ 5’ ) 3? Q’ 97” Q1? c) 2 , v2, V3, V4, 
2. Study the behaviour of the sequences below and compute the limit if this 
exists: 
n+9 
n— = ) = 0 b n— ) = 0 
A) @g=n(n—-1), we )oa say? 
2+ 6n? 
co) ee ea n>1 dd) a= vin n>0 
3n +n? 14a’ 
5n (- t= 1 n2 
e) On = Saal? n> 0 an = n=0 


g) Qn =arctandn, n>0 Qn =3+cosnr7, n>O 


nm COs nr 


i)] a, = 1+ (-1)"sin=, n>1 m= 120 
m) Qn =Vn+3— Jn, n>0 ay = SBE HO) n>1 
0) Gn = —3n+log(n+1)-logn, n>1 p) a, = n>1 
3. Study the behaviour of the following sequences: 
ee ee ia." — 
oo on (3) 33 
= 
io. = (a) ~ Gn = 2" sin(2 7) 


| 

~ 
Q 
o) 
nA 


[i) | on = a e) an =n! (cos =~ 1) 
n! 


4. Tell whether the following series converge; if they do, calculate their sum: 


»4(5) Dp ore, 
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) Stank (sing -sing5) 
si eel 
[| a p re: 


[5.] Using the geometric series, write the number 2.317 = 2.3171717... as a ratio 
of integers. 


6. Determine the values of the real number x for which the series below converge. 
Then compute the sum: 


CO k CO 
a) Le S > 3*(@ +2) 
k=2 k=1 
c) > = Ss" tan® x 
=i k=0 
Find the real numbers c such that 
Ste) * =2. 
k=2 


Co Co 
1 
Suppose S- ap (ax # 0) converges. Show that ye — cannot converge. 
k=1 i” 


9. Study the convergence of the following positive-term series: 


3 = ak 


k=1 k=1 
~ log k =a. il 

[e) a = aC ae, 
k=1 k=1 

1 — A 

> sing é) = 
1 k=0 

eis *. cos? k 

)d Vko +k? » kVk 
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~ 1 
10. Find the real numbers p such that » ——— converges. 
ra k (log 


ky 


[o-e) 
1 
[11.] Estimate the sum s of the series S- —— using the first six terms. 
= k2 +4 


12. Study the convergence of the following alternating series: 


k= k=0 

S- sin (ix + :) So(-1)* ((: + =) ‘ 
k=1 k=1 
= (=1)" 3k = ik 

é) 244k 1 2)" ke +1 


13. Check that the series below converge. Determine the minimum number n of 
terms necessary for the nth partial sum s,, to approximate the sum with the 
given margin: 


= (=1)e 

a) S- Ee? Tn|< 10-3 
k=1 

»! * ? rn| < 10-* 
k=1 : 
=“ (—1)*k 

c) 57! > rn| <2-1073 
k=1 


k=1 k=1 

= (=o) cos 3k 
ae ee 

k=1 k=1 

aa k “sinks 

—] k f 6 

oo (-1)F +15 R-1 = 10* 
yee bh) DT ED) SEH 

24 (k+ 14 24 (k+2)5 
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15. Study the convergence of the series: 


a) 3 (1 — cos =) 3 _ 


“1 fk = Re 
9 zn) yee 
k=1 k=1 
Sia ~ 3k—1 
'e) 2 A f) SCs 
1- - = -(2k — 1) = 2k+1 
16. Verify the following series converge and then — the sum: 
oO gk-1 
k 
a) (—1) 5k »)| os 9. a 
k=1 
2k +1 — 1 
i(k + 1 1) 2 BET I)k+3) 
k=1 k=0 
1.6.1 Solutions 
1. General terms and limits: 
a) a, = 9 ee Ly lim an = 1 
ntl i n—-oco 
b) Qn (yrs ,n>1, lim a =0 
@) Gy =A SO: lim a, = +oo 
2. Sequences’ behaviour and limit: 
a) Diverges to +00. b) Converges to 4. c) Converges to 6. 


d) Converges to 1. e) Diverges to +oo. 
2 


f) Si li 
) Since Me 


= 1, the sequence is indeterminate because 
(2n)? (2n +1)? 
l n= lim —-—— =-1l, Jit w= Jim a 
ae goes Ones 5g TE ees (Ont 121 
g) Converges to $. 


h) Recalling that cosna = (—1)”", we conclude immediately that the series is 
indeterminate. 


i) Since {sin +},,51 is infinitesimal and {(—1)"},>1 bounded, we have 
; ae 
lim (—1)"sin- =0, 
noo n 


hence the given sequence converges to 1. 
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¢) Since 


mcosn n n 1 
ne+1!/7 n2+17 n3~ n? 


by the Comparison Test we have 


m) Converges to 0. 
n) We have 


log(2+e”)  loge™"(1+2e-") 1 nm log(1 + 2e~) 
dn 4n 4 4n 
1 


so lim ayn = -. 
noo 


aN 


o) Diverges to —oo. p) Converges to 0. 


3. Sequences’ behaviour: 


a) Diverges to +00. b) Indeterminate. 
c) Recalling the geometric sequence (Example 1.1 i)), we have 


lim a, = lim GY 1) =-1; 
N—>00 N00 4r(4—- ete 1) 
and the convergence to —1 follows. 
d) Diverges to +oo. 
e) Let us write 


2n(2n—1)---(n+2)(n+1)_ 2n 2n-1 n+2 n+l 
=e Se es te 
n(n+1)---2-1 n n—-1 2 it 


25 


ie ae 


as lim (n+1) = +co, the Second Comparison Theorem (infinite case), implies 
noo 


the sequence diverges to +00. 
f) Converges to 1. 
g) Since 
On = exp = 2 log a 
ae +2 
we consider the sequence 


eae lee © =e lL . FB tog (1 - 2n+1 at). 


nm+n+2 m+n+2 
Note that 
1 2n+1 9 
m0 n2t+n+2 : 
so 
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Thus 
; . Vn?+2(2n+1) _ Qn? 
lim 6, = — im ——~-——— = —- lim — = -2; 
n+ 00 noo n2+tn+2 noo nN 


and the sequence {a,,} converges to e~?. 


h) Setting 2 =27"7, we have x + OT for n > 00, so 


sin x 


lim a, = lim 7 = 
noo x—O0t XL 
and {a,,} tends to 7. 
i) Observe 
m+ 7 T 7 
co = =cos{(=~+2z=—]=-—sin=; 
2 2n 2n 
therefore, setting x = 3, we have 
: ’ oo T sin x T 
lim a, = — lim nsin— =— lim = =-— 
n—00 n—00 2n z30+ 2 2 2 


so {a,} converges to —7/2. 
£) Converges to —1/2. 


4. Series’ convergence and computation of the sum: 


a) Converges with sum 6. 
b) Note 
li = lim —=20. 
aes ei arose k+5 7 
Hence the series does not converge, in fact it diverges to +oo. 
c) Does not converge. 


d) The series is telescopic; we have 


nal ee Cre nee ace = 
n= —_— _ —— — ered ——sin 
Ss sin oe 5 sea a ue mee 
= sin 1 — sin . 
m+ 1 
As lim s, =sin1, the series converges with sum sin 1. 
Noo 
e) Because 
Ore i) = (2) 1 1 i 
aa a) a a), =a a 
k=0 ° k=0 ° k=0 ? l= 3 Img 2 


the series converges to 7/2. 


f) Does not converge. 
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5. Write 
= a ar 17 1 1 
2.317 =2.34+ —7+ — + —74+...=2.34+—5(14+—,4+—7+.--. 
* 708 * 708 * To? * tp(taptipt ) 
17 1 i 23, 17 100 
=23+—,S —> =23+ —,— 7, ==—+—. — 
* 708 <= 10" * 1 1 wz 10 1000 99 


_ 23 | 17 1147 
~ 10 990 495 ° 


6. Series’ convergence and computation of the sum: 


a) Converges for |z| < 5 and the sum is s = sa: 


b) This geometric series has gq = 3(x + 2), so it converges if |3(a + 2)| < 1, ie., if 
LE (—£, —2). For x in this range the sum is 


1 32 +6 
S es — 


“12=3@3-2) ~" "3945" 


c) Converges for z € (—o0, —1) U(1, +00) with sum s = +. 
d) This is a geometric series where q = tanz: it converges if | tanz| < 1, that is 
if x € neg (—¥ + ka, F + kr). For such x, the sum is s = ——. 


7. This is a geometric series with q = a which converges for |1 +c] > 1, ie., for 


c< —2orc> 0. If so, 


ete ek = 
2 °) 1-4 l-—c c(l+c) 


= 2, we obtain c = =1v3 But as the parameter c varies within 
-1+V3 
ae 


Imposing nee] 


(—oo, —2) U (0, +00), the only admissible value is c = 
8. As » dp converges, the necessary condition jim ay = 0 must hold. Therefore 
— oo 


k=1 
foe) 


‘ Ds ; 
lim — is not allowed to be 0, so the series — cannot converge. 
k-0co Ak 4 a 


9. Convergence of positive-term series: 


a) Converges. 


b) The general term a; tends to +co as k — oo. By Property 1.6 the series 
diverges to +o0. Alternatively, one could invoke the Root Test 1.15. 
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By the Ratio Test 1.14: 


- Ak+1 3k k! 
lim ca 


coe Ge eo (RAIA 


writing (k + 1)! = (k + 1)k! and simplifying, we get 


‘ Qk4+1 
lim = im —= 
k-+00 Ap ko k +1 


The series then converges. 
Using again the Ratio Test 1.14: 


k 
det _ (k+1)! kF (A) -t< 


li = i ee a 
ted Ge ke (K+1)*+1 kl! pare 


tells that the series converges. 
As 
ay ~ k= for ko ow, 
we conclude that the series diverges, by the Asymptotic Comparison Test 1.12 
and the fact that the harmonic series diverges. 


Converges. 
Note logk > 1 for k > 3, so that 


logk 1 
> 3 b> 3. 
k; k; ~ 
The Comparison Test 1.10 guarantees divergence. 
Alternatively, we may observe that the function f(x) = 10g is positive and 


continuous for « > 1. The sign of the first derivative shows f is decreasing 
when x > e. We can therefore use the Integral Test 1.17: 
+00 } al l 2\¢ 
po ae = tim [PE ae = tim SREE 
3 


x cH +Ho }3 x c+>+00 


3 
(loge)? _ (log 3)? _ 


es “Yiny Se es 


c—+oo 2 2 


then conclude that the given series diverges. 


Converges by the Asymptotic Comparison Test 1.12, because 


— ~ a k + +00 
Qh] ° QF" 
[o2) 
and the geometric series S- aR converges. 

k=1 
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i) Diverges by the Asymptotic Comparison Test 1.12, because 


; 1 
sn-~ —, k + +00, 
k 
d the harmonic series y — diverges. 
an Ty 1 T1 a4 i 1V TS 
() Diverges. m) Converges. 


n) Converges by the Comparison Test 1.10, as 


cos? k au icechee 
kVk 7 kVk’ 
and the generalised harmonic series >. 732 converges. 
k=1 
10. Converges for p > 1. 
+00 
11. Compute / f(x) dx where f(x) = = z is a positive, decreasing and con- 
tinuous map on [0, +00): 
oo a er a 1 
A aaa q te = 5 [arctan S| | = 7 — 5 arctan s 
Since 
=-4f- .+ — = 0.7614 
S6 + ig + + 4 ; 
using (1.8) 


+00 +oo 
sot [ fla)ar <s<s+ f f(x) dz, 
7 6 
and we find 0.9005 < s < 0.9223. 


12. Convergence of alternating series: 
a) Converges conditionally. b) Does not converge. 
c) Since 


1 1 
sin (km + ra = cos(k7) sin = (=1)"sin—, 


the series is alternating, with 6; = sin i. Then 


lim 6; = 0 and be+1 a Dies 
k- oo 


By Leibniz’s Test 1.20, the series converges. It does not converge absolutely 


since sin ¢ ~ + for k — oo, so the series of absolute values is like a harmonic 


series, which diverges. 
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d) 
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Converges absolutely: by the relationship (1 + 2)* — 1 ~ aa, for x — 0, it 
follows 


k> ow. 


1\/2 V2 
k 

Bearing in mind Example 1.11 i), we may apply the Asymptotic Comparison 
Test 1.12 to the series of absolute values. 


Does not converges. 
2 


maa It is straightforward to check 


This is an alternating series with b, = 
lim bh = 0. 
k—- oo 


That the sequence bz is decreasing eventually is, instead, far from obvious. To 
show this fact, consider the map 


x 
f(z) i e3 ae 1 ’ 
and consider its monotonicity. Since 
fe) = a) 
(a + de 


and we are only interested in z positive, we have f’(xr) < 0 if 2-2? < 0, 
i.e., x > 2. Therefore, f decreases on the interval (¥/2,+00). This means 
f(k+1) < f(k), so be4i1 < by for k > 2. In conclusion, the series converges by 
Leibniz’s Test 1.20. 


13. Series’ approximation: 


a) 
b) 


c) 


n=d. 
The series is alternating with b, = a Immediately we see jim by = 0, and 
, — oo 


bp41 < by for any k > 1 since 


gk+l gk 2 
b => SS —=b0 ——- — 1 — k 1. 
EL a ca 7 


Imposing bp+1 < 10~? = 0.01, one may check that 


8 2 
by = — ~ = 0.02, bs = 375 = 9.006 < 0.01. 


The minimum number of terms needed is n = 7. 


n=d. 


14. 


a) 


o 
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Absolute convergence: 


There is convergence but not absolute convergence. In fact, the alternating 
series converges by Leibnitz’s Test 1.20, whereas the series of absolute values 
is generalised harmonic with exponent a = $ mo, 


Does not converge. 


The series converges absolutely, as one sees easily by the Ratio Test 1.14, for 
example, since 


a. Dip ’ ane | 
lim — = lm —— 


k00 «(by k-00 Ch + 1)! , Qk 


ii ea 
(sha. 


Convergence is absolute, since the series of absolute values converges by the 
Comparison Test 1.10: 


k3 


cos =| 


Converges, but not absolutely. 
Converges absolutely. 
Does not converge since its general term does not tend to 0. 


Converges absolutely. 


. Convergence of series: 


Converges. 
Observe 
sink 1 
72 <p for allk > 0; 
CO 
the series S- BR converges, so the Comparison Test 1.10 forces the series of 
k=1 


absolute values to converge, too. Thus the given series converges absolutely. 
Diverges. 


This is an alternating series, with b, = /2—1. The sequence {bi }u>1 decreases, 
as 2 > “‘/2 for any k > 1. Thus we can use Leibniz’s Test 1.20 to infer 
convergence. The series does not converge absolutely, for 


log 2 
V2 —1 = el87/*§ 1 ~ : ko ow, 


just like the harmonic series, which diverges. 
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Note 


a k! 1.2.3 k aie 
w- {agehoOeed),: ~ & 5 Meo 4a 
since k 9) 
= a2 
a a 
The convergence is then absolute, because S- bx converges by the Comparison 
k=1 


Test 1.10 (it is bounded by a geometric series with q = 2 he 


Does not converge. 


. Checking series’ convergence and computing the sum: 


—1/7. 
Up to a factor this is a geometric series; by Example 1.3 iii) we have 


=. BF icy i i 3 
Peer: son 16 } 6 


(notice that the sum starts from index 1). 


It is a telescopic series because 


2k+1 7 1 1 ; 
R(k+12 kb (k+12° 
so 
1 
Sn = a ’ 
(n+ 1)? 


and then s = lim s, = 1. 
n—- oo 
lye. 


2 


Series of functions and power series 


The idea of approximating a function by a sequence of simple functions, or known 
ones, lies at the core of several mathematical techniques, both theoretical and 
practical. For instance, to prove that a differential equation has a solution one 
can construct recursively a sequence of approximating functions and show they 
converge to the required solution. At the same time, explicitly finding the values 
of such a solution may not be possible, not even by analytical methods, so one idea 
is to adopt numerical methods instead, which can furnish approximating functions 
with a particularly simple form, like piecewise polynomials. It becomes thus crucial 
to be able to decide when a sequence of maps generates a limit function, what sort 
of convergence towards the limit we have, and which features of the functions in 
the sequence are inherited by the limit. All this will be the content of the first part 
of this chapter. 

The recipe for passing from a sequence of functions to the corresponding series 
is akin to what we have seen for a sequence of real numbers; the additional com- 
plication consists in the fact that now different kinds of convergence can occur. 
Expanding a function in series represents one of the most important tools of 
Mathematical Analysis and its applications, again both from the theoretical and 
practical point of view. Fundamental examples of series of functions are given by 
power series, discussed in the second half of this chapter, and by Fourier series, to 
which the whole subsequent chapter is dedicated. Other instances include series of 
the classical orthogonal functions, like the expansions in Legendre, Chebyshev or 
Hermite polynomials, Bessel functions, and so on. 

In contrast to the latter cases, which provide a global representation of a func- 
tion over an interval of the real line, power series have a more local nature; in fact, 
a power series that converges on an interval (xo — R, xo + R) represents the limit 
function therein just by using information on its behaviour on an arbitrarily small 
neighbourhood of 29. The power series of a functions may actually be thought of 
as a Taylor expansion of infinite order, centred at xo. This fact reflects the intu- 
itive picture of a power series as an algebraic polynomial of infinite degree, that 
is a sum of infinitely many monomials of increasing degree. In the final sections 
we will address the problem of rigorously determining the convergence set of a 
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power series; then we will study the main properties of power series and of series 
of functions, called analytic functions, that can be expanded in series on a real 
interval that does not reduce to a point. 


2.1 Sequences of functions 


Let X be an arbitrary subset of the real line. Suppose that there is a real map 
defined on X, which we denote f, : X — R, for any n larger or equal than a 
certain no > 0. The family {fn}n>n. is said sequence of functions. Examples 
are the families f,(x) = sin(a + 1), n> 1 ot jn(e) =x", wv > 0,00 xX =. 

As for numerical sequences, we are interested in the study of the behaviour of 
a sequence of maps as n — oo. The first step is to analyse the numerical sequence 
given by the values of the maps f, at each point of X. 


Definition 2.1 The sequence {fn}n>n, converges pointwise at T ¢ X if 
the numerical sequence {fn(Z)}n>no converges asn— co. The subset AC X 
of such points & is called set of pointwise convergence of the sequence 
{fn}nsny- Phis defines a map f : A — R by 


Lie (ee Vae A. 
noo 


We shall write f, > f pointwise on A, and speak of the limit function f of 
the sequence. 


Note f is the limit function of the sequence if and only if 


lim |fn(x) — f(#)| =0, Vee A. 


nN oco 


Examples 2.2 
i) Let fn(x) = sin (x + 1), n > 1, on X =R. Observing that 2 + 4 +> © as 
n — oo, and using the sine’s continuity, we have 
1 
f(x) = lim sin(z+—) =sinz, Ve ER, 

noo n 
hence A= X = R. 
ii) Consider f,(x) = x", n > 0, on X =R,; recalling (1.1) we have 


0 if-l<a<l 
vc) = lim 2” = : 
He) . ifv@=1. 


The sequence converges for no other value x, so A = (—1,1]. 


(2.1) 


The notion of pointwise convergence is not sufficient, in many cases, to transfer 
the properties of the single maps f,, onto the limit function f. Continuity (but also 
differentiability, or integrability) is one such case. In the above examples the maps 
fn(x) are continuous, but the limit function is continuous in case i), not for ii). 
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A more compelling convergence requirement, that warrants continuity is passed 
onto the limit, is the so-called uniform convergence. To understand the difference 
with pointwise convergence, let us make Definition 2.1 explicit. This states that 
for any point x € A and any ¢ > 0 there is an integer 7 such that 


Wis op BS. SS gle) — Fe) < 2: 


In general 7% depends not only upon « but also on g, ie., 7 = R(e, x). In other 
terms the index 7, after which the values f,(x) approximate f(x) with a margin 
smaller than ¢, may vary from point to point. For example consider f,(x) = x”, 
with 0 < x < 1; then the condition 


fn(x) — F(a)| = |e" — Ol = 2" <e 


holds for any n > ee. Therefore the smallest n for which the condition is valid 


tends to infinity as « approaches 1. Hence there is no 7 depending on ¢ and not 
on &. 

The convergence is said uniform whenever the index 7 can be chosen inde- 
pendently of «. This means that, for any ¢ > 0, there must be an integer 7 such 
that 


Yn>n,n>n = (|fr(e)—f(a)|<e, Vane A. 
Using the notion of supremum and infimum, and recalling ¢ is arbitrary, we can 


reformulate as follows: for any ¢ > 0 there is an integer 7 such that 


Yn >n,n>n => sup|fn(x) —f(e)| <e. 
ZEA 


Definition 2.3 The sequence {fn}n>n, converges uniformly on A to the 
limit function f if 
lim sup |fn(z) — f(x)| = 0. 


noo rEA 


Otherwise said, for any € > 0 there is an T% = (Ee) such that 


ite age at —— |e a (| ee Vae A. 


We will write fn > f uniformly on A. 


Let us introduce the symbol 


Ilg|loo,a = sup |g(z)|, 
rEA 


for a bounded map g : A — R; this quantity is variously called infinity norm, 
supremum norm, or sup-norm for short (see Appendix A.2.1, p. 521, for a compre- 
hensive presentation of the concept of norm of a function). An alternative definition 
of uniform convergence is thus 


lim |[fn — flloo,a = 0. (2.3) 
n—-oco 
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Clearly, the uniform convergence of a sequence is a stronger condition than 


pointwise convergence. By definition of sup-norm in fact, 


Vee A, — |fn() — f(a) < Ilfn — flloo,a 


so if the norm on the right tends to 0, so does the absolute value on the left. 
Therefore 


Proposition 2.4 If the sequence {fn}n>n, converges to f uniformly on A, 


it converges pointwise on A. 


The converse is false, as some examples show. 


Examples 2.5 


i) The sequence f,,(x) = sin (+ +), n > 1, converges uniformly to f(x) = sinx 
on R. In fact, using known asta identities, we have, for any x € R, 


1 1 
a <9 a 
sin || cos (2 + =] | sin mn? 


for example. Therefore 


1 
sin (a+ — ) — sin] = = 2 
n 


moreover, equality is attained for x = —z, 


1 1 
sin («+ -) — sina} = 2sin — 
n 2n 


[fn — flloo,.e = sup 
xcER 
and passing to the limit for n + oo we obtain the result (see Fig. 2.1, left). 


ii) As already seen, fn(x) = 2", n > 0, does not converge uniformly on I = (0, 1] 
to f defined in (2.1). For any n > 0 in fact, ||fn—flloo,r7 = sup £” = 1 (Fig. 2.1, 
O<a<l 


right). Nevertheless, the convergence is uniform on every sub-interval J, = (0, al, 
0<a< 1, for 
lfn — flloo,7, = sup |2”-O|=a"—>0 asn- oo. 
x€[0,a] 
Therefore the sequence converges to zero uniformly on J,. More generally one can 
show the sequence converges uniformly to zero on any interval [—a,a],0 <a <1. 


The following criterion is immediate to check, and useful for verifying uniform 
convergence. 


Proposition 2.6 Let the sequence {fn}n>n. converge pointwise on A to a 
function f. Take a numerical sequence {Mn}n>no, infinitesimal for n — ov, 
such that 


lfn(z) — f(x)| < Mn, Vee A. 
Then fn > f uniformly on A. 


The property has been used in the previous examples, with M, = 2sin = for 
case i), and M,, = a” for ii). 
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Ya 


Ya 


sin x 


Figure 2.1. Graphs of the functions f, and their limit f relative to Examples 2.5 i) 
(left) and ii) (right) 


2.2 Properties of uniformly convergent sequences 


As announced earlier, under uniform convergence the limit function inherits con- 
tinuity from the sequence. 


Theorem 2.7 Let the sequence of continuous maps {fn}n>no converge uni- 


formly to f on the real interval I. Then f is continuous on I. 


Proof. By uniform convergence, given ¢ > 0, there is an % = N(e) > no such that 
for any n > 7 and any x € I 
E 


nlx) — f(0)| < 5 


Fix xp € J and take n > 7. As fy, is continuous at xo, there is a 6 > 0 such 
that, for each x € I with |x — xo| < 6, 


\fn(@) — falwo)| < 5. 
Let therefore x € I with |x — xo| < 6. Then 
|f(x) — f(xo)| < |F(®) — fn(@)| + |fn(@) — fn(vo)| + 
+|fn(to0) — f(xo)| < statone, 
O 


so f is continuous at x. 


This result can be used to say that pointwise convergence is not always uniform. 
In fact if the limit function is not continuous while the single terms in the sequence 
are, the convergence cannot be uniform. 
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2.2.1 Interchanging limits and integrals 


Suppose f, — f pointwise on I = [a,b]. If the maps are integrable, it is not true, 
in general, that 


[ fulwyae > [ tee. 


Example 2.8 
Let fn(x) = en?e-™ on I = (0, 1]. Then f,,(2) > 0 = f(x), as n > oo, pointwise 


1 
on I. Therefore i. f(x) dx = 0; on the other hand, setting y(t) = te~* we have 
0 


fi fal) dz = [ y(nz)n dx = [ y(t) dt = —ne“" + [-e-]? 


=-—ne"-e "+11 fornow. 


Uniform convergence is a sufficient condition for transferring integrability to 
the limit. 


Theorem 2.9 Let I = |a,b] be a closed, bounded interval and {fn}n>no @ 
sequence of integrable functions over I such that fn — f uniformly on I. 
Then f is integrable on I, and 


[ ears f see as 2 — oOo. (2.4) 


Proof. The integrability of the limit function is immediate if each f,, is continuous, 
for in that case f itself is continuous by the previous theorem. In general, 
one needs to approximate the functions by means of step functions, as 
prescribed by the definition of an integrable map; the details are left to 
the reader’s good will. 

In order to prove (2.4), let us fix « > 0; then there exists an 7 = N(e) > no 
such that, for any n > 7% and any x € J, 


Ee 
b=a 


|fn(a) — F(#)| < 


Therefore, for all n > 7%, we have 


[ hide [5 aby 


< f ime)-s@lae< f GS 


b 
/ (fa(e) — f(a) de 


dt =<, oO 
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Note that (2.4) can be written as 


showing that uniform convergence allows to exchange the operations of limit and 
integration. 


2.2.2 Interchanging limits and derivatives 


When considering limits of differentiable functions, certain assumptions on uniform 
convergence guarantee the differentiability of the limit. 


Theorem 2.10 Let {fn}n>no be a sequence of C' functions over the interval 
I = [a,b]. Suppose there exist maps f and g on I such that 


i) fn — f pointwise on I; 
ii) fi + g uniformly on I. 


Then f isC! on I, and f' = g. Moreover, fn > f uniformly on I (and clearly, 
fi, > f' uniformly on I). 


Proof. Fix an arbitrary xo € J and set 
Fle) = fla0) + f g(at. (2.5) 


We show first that fn f uniformly on J. For this, let ¢ > 0 be given. By 
i), there exists ny = n1(€;20) > no such that for any n > n1 we have 
E 


Jato) ~ ft0)| <5 
By it), there is ng = ne(E) > no such that, for any n > nz and any t € [a, 5], 
i) 20) < oa 
Note we may write each map fy, as 
ful) = fale) + fo fu(tyar, 
xo 
because of the Fundamental Theorem of Integral Calculus (see Vol. I, 
Cor. 9.42). Thus, for any n > 7 = max(n1,n2) and any z € |a, b], we have 


Fulte) — f (xo) +f (f,,(t) | g(t)) dt 


xO 


lFn(x) _ f(2)| = 
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x 


< |fnlvo) — F(eo)| + f \fa(t) — g(t] at 


TO 


b 
E real a ae 
ee 
<s+ap—a | 73° 


Therefore fn — f uniformly on J, hence also pointwise, by Proposition 
2.4. But fn — f pointwise on J by assumption; by the uniqueness of the 
limit then, f coincides with f. From (2.5) and the Fundamental Theorem 
of Integral Calculus it follows that f is differentiable with first derivative g. 


Oo 


Under the theorem’s assumptions then, 


li (ae — ( lim fala) Va el, 


Noo N—-> Co 


so the uniform convergence (of the first derivatives) allows to exchange the limit 
and the derivative. 


Remark 2.11 A more general result, with similar proof, states that if a sequence 
1 ti bag OFC ! maps satisfies these two properties: 


i) there is an x € |a, b| such that lim falto) =f € RB; 


ii) the sequence { f/ }n>n. converges uniformly on J to a map g (necessarily con- 
tinuous on J), 


then, setting 


x 


fle)=e+ f g(tat, 


0 


fn converges uniformly to f on J. Furthermore, f € C! and f’(x) = g(x) for any 
x € [a, bj. Oo 


Remark 2.12 An example will explain why mere pointwise convergence of the 
derivatives is not enough to conclude as in the theorem, even in case the sequence 
of functions converges uniformly. Consider the sequence f,(2) = x — «"/n; it 
converges uniformly on J = [0,1] to f(x) = x because 


ge” 


fal) — fe) =|= 


1 
<-, Vael, 
n 


and hence F 
ie = Flee oT asm — oo. 


Yet the derivatives f’ (x) = 1-—2x"~! converge to the discontinuous function 


2.3 Series of functions Al 
1 ifze€/{0,1), 
g(x) = { 1) 
0 ifx#=l1. 


So {f/}n>0 converges only pointwise to g on [0,1], not uniformly, and the latter 
does not coincide on J with the derivative of f. Oo 


2.3 Series of functions 


Starting with a sequence of functions {f,}%>x%. defined on a set X C R, we can 
CO 


build a series of functions y f, in a similar fashion to numerical sequences and 
k=ko 
series. Precisely, we consider how the sequence of partial sums 


s(x) = D> fe(2) 


k=ko 
behaves as n — oo. Different types of convergence can occur. 


[e.) 
Definition 2.13 The series of functions S- f, converges pointwise at T 
k=ko 
if the sequence of partial sums {5n}n>ko converges pointwise at Z; equivalently, 


co 
the numerical series S- fx (Z) converges. Let A C X be the set of such points 


k=ko 
CO 


z, called the set of pointwise convergence of S- fe; we have thus defined 
k=ko 
the function s: A + R, called sum, by 


The pointwise convergence of a series of functions can be studied using at each 
point « € X what we already know about numerical series. In particular, the 
sequence {fx(x)}x>x must be infinitesimal, as k — oo, in order for x to belong 
to A. What is more, the convergence criteria seen in the previous chapter can be 
applied, at each point. 


[e.) 
Definition 2.14 The series of functions So fy, converges absolutely on 
k=ko 


[e-@) 
A if for any x € A the series S- |fx(x)| converges. 
k=ko 
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CO 


Definition 2.15 The series of functions 3S fy, converges uniformly to 
k=ko 


the function s on A if the sequence of partial sums {Sn }n>ko converges uni- 
formly to s on A. 


Both the absolute convergence (due to Theorem 1.24) and the uniform conver- 
gence (Proposition 2.4) imply the pointwise convergence of the series. There are 
no logical implications between uniform and absolute convergence, instead. 


Example 2.16 


CO 
The series ye is nothing but the geometric series of Example 1.3 where 
k=0 
q is taken as independent variable and re-labelled x. Thus, the series converges 
pointwise to the sum s(x) = ~+ on A = (—1,1); on the same set there is absolute 
convergence as well. As for uniform convergence, it holds on every closed interval 
[—a, a] with 0 <a< 1. In fact, 


Lg 1 a ae 
Sele) 8) = 1l-z agi oe oa. 
where we have used the fact that |x| < a implies 1 — a < 1-2. Moreover, 
the sequence M,, = nie tends to 0 as n — ov, and the result follows from 


Proposition 2.6. 


It is clear from the definitions just given that Theorems 2.7, 2.9 and 2.10 can 
be formulated for series of functions, so we re-phrase them for completeness’ sake. 


Theorem 2.17 Let {fi}n>k. be a sequence of continuous maps on a real 
[e.) 


interval I such that the series ye fr converges uniformly to a function s on 
k=ko 
I. Then s ts continuous on I. 


Theorem 2.18 (Integration by series) Let I = [a,b] be a closed bounded 


(oe) 


interval and { fr}k>ko a Sequence of integrable functions on I such that Ss" fr 
k=ko 
converges uniformly to a function s on I. Then s is integrable on I, and 


[ sear= [> near s [ fewee. (2.6) 


k=ko 7% 
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This is worded alternatively by saying that the series is integrable term by term. 


Theorem 2.19 (Differentiation of series) Let {fx }x>x, be a sequence of 
C' maps on I = [a,b]. Suppose there are maps s and t on I such that 


DY) SG) S30) ees 


tno 


ii) SE fi, (x) =t(x), Va ETI and the convergence is uniform on I. 


k=ko 
[e-@) 
Then s € C\(I) and s' =t. Furthermore, » fr converges uniformly to s on 


k=ko 
CO 


I (and SS fi, converges uniformly to s‘). 
k=ko 


That is to say, 


and the series is differentiable term by term. 


The importance of uniform convergence should be clear by now. But checking 
uniform convergence is another matter, often far from easy. Using the definition 
requires knowing the sum, and as we have seen with numerical series, the sum is not 
always computable explicitly. For this reason we will prove a condition sufficient 
to guarantee the uniform convergence of a series, even without knowing its sum in 
advance. 


Theorem 2.20 (Weierstrass’ M-test) Let {feo be a sequence of maps 
on X and {Mx}k>ko @ sequence of real numbers such that, for any k > ko, 


ree hil SS Vie Ve ex. 


[ee) CO 
Assume the numerical series y My, converges. Then the series y fr con- 
k=ko k=ko 


verges uniformly on X. 
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CO 
Proof. Fix x € X, so that the numerical series 2; |fx(x)| converges by the 
k=ko 
Comparison Test, and hence the sum 


s(c)= So f(z), WaeX, 
k=ko 


is well defined. It suffices to check whether the partial sums {sn }n>xk, 
converge uniformly to s on X. But for any x € X, 


|sn(x) — s()|=| D> fe(a))< S> |fe(@)l< So Me, 
k=n+1 k=n+1 k=n-+1 
1.e., 
sup |s,(x) — s(x)| < Zz; M,, 
of k=n+1 


(oe) 
As the series S- M;, converges, the right-hand side is just the nth re- 
k=ko 
mainder of a converging series, which goes to 0 as n > oo. 
In conclusion, 
lim sup |s,(x) — s(x)| =0 
ints noo rex 


S\e) > fx converges uniformly on X. Oo 
k=ko 


Example 2.21 


We want to understand the uniform and pointwise convergence of 
"sin k*a 
S- ; xeR. 


Note 
sin k*x 1 


ieee 
kV/k |” kk 


(oe) 
we may then use the M-test with M; = re since the series S- 
k=1 


Ve ER; 


73/2 converges 


(it is generalised harmonic of exponent 3/2). Therefore the given series converges 
uniformly, hence also pointwise, on R. 


2.4 Power series 


Power series are very special series in which the maps fz are polynomials. More 
precisely, 
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Definition 2.22 Fix x € R and let {ax}n>0 be a numerical sequence. One 
calls power series a series of the form 


(oe) 


S| ax (x — 20)* = ap + a1 (a — 0) + a2(x — 20)? + °°: an) 
k=0 


The point xo is said centre of the series and the numbers {ax}xn>o0 are the 
series’ coefficients. 


The series converges at its centre, irrespective of the coefficients. 


The next three examples exhaust the possible types of convergence set of a 
series. We will show that such set is always an interval (possibly shrunk to the 
centre). 


Examples 2.23 


i) The series 


[o-2) 

Skee =—2¢+4e? 4 27234... 

k=1 
converges only at x = 0; in fact at any other x 4 0 the general term k*x" 
is not infinitesimal, so the necessary condition for convergence is not fulfilled 
(Property 1.6). 


ii) Consider 


asd k 2 3 

xv 4b XL 
) te ay aes (2.8) 
k=0 


it is known as exponential series, because it sums to the function s(x) = e®. 


This fact will be proved later in Example 2.46 i). 
The exponential series converges for any x € R. Indeed, with a given x 4 0, the 
Ratio Test for numerical series (Theorem 1.14) guarantees convergence: 

gers fl 
(k+1)! ok 


iii) Another familiar example is the geometric series 


lim = lim ae 
k—+00 k>00k+1 


=; Va €R\ {0}. 


Soe alt+et+e? +e +--- 
k=0 


(recall Example 2.16). We already know it converges, for x € (—1,1), to the 


function s() = ch. 


In all examples the series converge (absolutely) on a symmetric interval with 
respect to the centre (the origin, in the specific cases). We will see that the con- 
vergence set A of any power series, independently of the coefficients, is either a 
bounded interval (open, closed or half-open) centered at xo, or the whole R. 
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We start by series centered at the origin; this is no real restriction, because the 


substitution y = © — Xo allows to reduce to that case. 


Before that though, we need a technical result, direct consequence of the Com- 
parison Test for numerical series (Theorem 1.10), which will be greatly useful for 


power series. 


[e.2) 
Proposition 2.24 If the series SS ape", E #0), has bounded terms, in par- 


k=0 
foe) 


ticular if it converges, then the power series y aya 
k=0 


any @ such ian |a| = |x), 


converges absolutely for 


Proof. As az%, is bounded, there is a constant M > 0 
For any x with |z| < |z| then, 


k k 
% pe [ a 
lana | = ARE” (=) — M \=| ) 
x x 


[oe 
k 
But |x| < |Z|, so the geometric series > (=) 
XL 


k=0 
co 


by the Comparison Test, >, a,x" converges absolutely. 


k=0 


Example 2.25 


=< 1 
The series y x* has bounded terms when x = 
5 k+1 


such that 
0. 


Vk >0. 


converges absolutely and, 


+1, since 


k-1 
< 
al <1 


any k > 0. The above proposition forces convergence when |x| < 1. The series 


does not converge when || < 1 because, in case x = 4 
infinitesimal. 


t1, the general term is not 


Proposition 2.24 has an immediate, yet crucial, consequence. 


[o-2) 
Corollary 2.26 If a power series ae converges at x; # O, it con- 


k=0 
verges absolutely on the open interval (—|x1|,|x1|); 


if it does not converge 


at x2 #0, it does not converge anywhere along the half-lines (|x2|, +00) and 


(—00, —|2|). 
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no convergence 


convergence 


Figure 2.2. Illustration of Corollary 2.26 


The statement is depicted in Fig. 2.2, for x; > 0 and x2 < 0. 


Now we are in a position to prove that the convergence set of a power series is 
a symmetric interval, end-points excluded. 


CO 


Theorem 2.27 Given a power series Sore only one of the following 
holds: hae 


a) the series converges at x = 0 only; 


b) the series converges pointwise and absolutely for any x € R; moreover, it 
converges uniformly on every closed and bounded interval |a, }|; 


c) there is a unique real number R > 0 such that the series converges 
pointwise and absolutely for any |x| < R, and uniformly on all intervals 
[a,b] C (—R, R). Furthermore, the series does not converge on |x| > R. 


CO 


Proof. Let A denote the set of convergence of =, anu™. 


If A = {0}, we have case a). = 

Case b) occurs if A = R. In fact, Corollary 2.26 tells the series converges 
pointwise and absolutely for any x € R. As for the uniform convergence 
on [a, bj, set L = max(|a|, |b). Then 


Lele) = lage"| = lap", Va € |[a, 6]; 


and we may use Weierstrass’ M-test 2.20 with My = |ax|L*. 

Now suppose A contains points other than 0 but is smaller that the whole 

line, so there is an © ¢ A. Corollary 2.26 says A cannot contain any x with 

|x| > |Z|, meaning that A is bounded. Set R = sup A, so R > 0 because A 

is larger than {0}. Consider an arbitrary x with |z| < R: by definition of 
(oe) 


supremum there is an x; such that |x| < 21 < Rand z, apart converges. 
k=0 

Hence Corollary 2.26 tells the series converges pointwise and absolutely at 

x. For uniform convergence we proceed exactly as in case b). At last, by 

definition of sup the set A cannot contain values x > R, but neither values 
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x < —R (again by Corollary 2.26). Thus if |x| > R the series S- apx" does 


k=0 
not converge. Oo 


CO 
Definition 2.28 One calls convergence radius of the series Sane the 


k=0 
number 


[oe) 
sie e ER: Ss anu converges} ; 
k=0 


Going back to Theorem 2.27, we remark that R = 0 in case a); in case b), 
R = +00, while in case c), R is precisely the strictly-positive real number of the 
statement. 


Examples 2.29 


Let us return to Examples 2.23. 
co 


i) The series > k®c* has convergence radius R = 0. 


k=1 
ii) For > Fr the radius is R = +00. 
k=0 
[o2) 
iii) The series Ss" a* has radius R = 1. Oo 
k=0 
Beware that the theorem says nothing about the behaviour at x = +R: the 


series might converge at both end-points, at one only, or at none, as in the next 
examples. 


Examples 2.30 


i) The series 


Ys 
5 
k=1 k 
converges at x = +1 (generalised harmonic of exponent 2 for x = 1, alternat- 
ing for = —1). It does not converge on |x| > 1, as the general term is not 


infinitesimal. Thus R = 1 and A = [-1, 1]. 


ii) The series 

y= 

=" 
converges at x = —1 (alternating harmonic series) but not at x = 1 (harmonic 
series). Hence R = 1 and A= [-1,1). 
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iii) The geometric series 
(oe) 
d 2 
k=1 


converges only on A = (—1,1) with radius R = 1. 


Convergence at one end-point ensures the series converges uniformly on closed 
intervals containing that end-point. Precisely, we have 


Theorem 2.31 (Abel) Suppose R > 0 is finite. If the series converges at 


x = R, then the convergence is uniform on every interval |a, R| Cc (—R, RI. 
The analogue statement holds if the series converges at x = —R. 


If we now center a power series at a generic 2, the previous results read as 
[o2) 


follows. The radius R is 0 if and only if S- ap(x — x0)" converges only at a9, while 
k=0 

R = +oo if and only if the series converges at any x in R. In the remaining case 

R is positive and finite, and Theorem 2.27 says the set A of convergence satisfies 


{e@ER:|x—a2o|< R} CAC{xe#ER: |x —2o| < R}. 


The importance of determining the radius of convergence is evident. The next 
two criteria, easy consequences of the analogous Ratio and Root Tests for numerical 
series, give a rather simple yet useful answer. 


Theorem 2.32 (Ratio Test) Given the power series 


CO 
Ss" an(x — rea 
k=0 


with az #0 for all k > 0, if the limit 


: Ak+1 
lim =e 
k-co| Ak 


exists, the radius of convergence R is given by 


0 if £= +00, 


ffO<L<+00. 
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Proof. For simplicity suppose zo = 0, and let x ~ 0. The claim follows by the 
Ratio Test 1.14 since 


QAk+1 
Qk, 


|x| = 2|2| . 


k-> oo 


When ¢ = +00, we have ¢|z| > 1 and the series does not converge for any 
x #0, so R =0; when @=0, ¢|z| = 0 < 1 and the series converges for any 
x, so R=-+oo. At last, when @ is finite and non-zero, the series converges 
for all x such that ¢|x| < 1, so for |x| < 1/€, and not for |z| > 1/@; therefore 
= 17s, O 


Theorem 2.33 (Root Test) Given the power series 


if the limit 


exists, the radius R is given by formula (2.9). 


The proof, left to the reader, relies on the Root Test 1.15 and follows the same 
lines. 


Examples 2.34 


i) The series ka” has radius R = 1, because lim Vk = 1; it does not 


k—- oo 
k=0 
converge for x = 1 nor for x = —1. 


ii) Consider 


and use the Ratio Test: 


(k+1)! ke. k\e i. 
fae (k+1)'+! k! ae 5 7 : 


The radius is thus R = e. 
iii) To study 


Yoo IE (eo) (2.10) 
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set y = (2 — 2)? and consider the power series in y 


(oe) 


il 
= y* (19) 
— aa \(k+ (E—-D(k +2)" 
centred at the origin. Since 
2k+1 
lim = 


ko Vl (k—1(k+2) _ 
the radius is 1. For y = 1 the series (2.11) reduces to 
3 2k+1 
fae Lee) 
which diverges like the harmonic series (ah ~ 2, k + ov), whereas for 
y = —1 the series (2.11) converges (by Leibniz’s Test 1.20). In summary, (2.11) 
converges for -—l1 <y <1. 
Going back to the variable x, that means —1 < (a — 2)? < 1. The left inequality 


is always true, while the right one holds for —1 < x—2 < 1. So, the series (2.10) 
has radius R = 1 and converges on the interval (1,3) (note the centre is ro = 2). 


iv) The series 0° 
Soc aartatar+ao tay... 


is a power series where infinitely many coefficients are 0, and we cannot substitute 
as we did before; in such cases the aforementioned criteria do not apply, and it 
is more convenient to use directly the Ratio or Root Test for numerical series. 
In the case at hand 


(k+1)! 
foe ee =—| = lim |a|+!-4 
k—-00 M bie k—-0o 
is 0 if eae, 
= fim lel =) 400. if [2] > 1. 
Thus R = 1. The series converges neither for x = 1, nor for x = —1. 


v) Consider, for a € R, the binomial series 


a ft 
k 
a 
» (4) 
k=0 
If a= n EN the series is actually a finite sum, and Newton’s binomial formula 


(Vol. I, Eq. (1.13)) tells us that 


Y (pj ate 
k 
k=0 
hence the name. Let us then study for a € R \ N and observe 

O20] fala v-fo= wf 
ies) (k +1)! jaia—1)--(a—-k+| k+1" 
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therefore 


= .21)| an Or 


ko |(2)]—sk00 RL 


and the series has radius R = 1. The behaviour of the series at the endpoints 
cannot be studied by one of the criteria presented above; one can prove that the 
series converges at x = —1 only for a > 0 and at x = 1 only for a> —1. 


2.4.1 Algebraic operations 


The operations of sum and product of two polynomials extend in a natural manner 
to power series centred at the same point x29; the problem remains of determining 
the radius of convergence of the resulting series. This is addressed in the next 
theorems, where 2% will be 0 for simplicity. 


Theorem 2.35 Given }; = ae ine Lay = Sire of respect- 
k=0 


k=0 
foe) 


ive radii Ri, Ro, their sum ) = So (ax aL by) a* has radius R satisfying 


k=0 
R>min(Ri, Ro). If Ri # Ro, necessarily R = min(R, Ro). 


Proof. Suppose R; ~ Rg; we may assume R; < Rg. Given any point x such that 
R, <a < Rg, if the series 4’ converged we would have 


2 = > axa" = So (ax + by) a* = > bis = J} — dig, 
k=0 k=0 k=0 


hence also the series 4’; would have to converge, contradicting the fact 
that + > R,. Therefore R = R, = min(R;, Ro). 

In case Ry = Ro, the radius R is at least equal to such value, since the 
sum of two convergent series is convergent (see Sect. 1.5). Oo 


In case Ry = Ro, the radius R might be strictly larger than both R,, R2 due 
to possible cancellations of terms in the sum. 


Example 2.36 


The series 
Se 2 op So 
21) eet and = 2. OR 


though, has radius R = 4. 
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The product of two series is defined so that to preserve the distributive property 
of the sum with respect to the multiplication. In other words, 
(a9 + aya + asa ts. .)(bo + bya + bear es: s) 
= agbo + (aob1 + a1bo)a + (aobz + a1b1 + agbo)a* +... , 


L.€., 
(So axa") (So bo) = ae (2.12) 
k=0 k=0 k=0 
where 


k 
Cc. = ) eo ae 
j=0 


This multiplication rule is called Cauchy product: putting x = 1 returns precisely 
the Cauchy product (1.9) of numerical series. Then we have the following result. 


Co [o-e) 
Theorem 2.37 Given ); = apx® and Sy = ye b,a* , of respective radii 
k=0 


k=0 
R,, Ro, their Cauchy product has convergence radius R > min(R1, Re). 


2.4.2 Differentiation and integration 


Let us move on to consider the regularity of the sum of a power series. We have 
already remarked that the functions f,(2) = ax(a — 2o)* are C© polynomials 
over all of R. In particular they are continuous and their sum s(x) is continuous, 
where defined (using Theorem 2.17), because the convergence is uniform on closed 
intervals in the convergence set (Theorem 2.31). Let us see in detail how term-by- 
term differentiation and integration fit with power series. For clarity we assume 
xo = 0 and begin with a little technical fact. 


Lemma 2.38 The series }>, = Saey UE So = as kapx™ have the same 


radius of convergence. 


Proof. Call Ri, R2 the radii of }7,, 5°, respectively. Clearly Ro < Ry (for laxx*| < 
|ka,a*|). On the other hand if |z| < R, and Z satisfies |z| < 7 < Ry, 


the series > ao" converges and so |az|z* is bounded from above by a 
k=0 
constant M > 0 for any k > 0. Hence 
x \k xk 
kana*| = klax|a* |=| < Mk|=| . 
e x 
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= k 
Since Sok (=) is convergent (Example 2.34 i)), by the Comparison 
E 
k=0 


[o-e) 
Test 1.10 also S> kag converges, whence Ry < Rg. In conclusion 
k=0 
R, = Rz and the claim is proved. Oo 
[o) [o) [o.) 
The series So kaya* = So kapa* = Sok + 1)ag412" is the derivatives’ 
k=0 k=1 k=0 
[o-) 
series of y aga”. 
k=0 


[o.e) 

Theorem 2.39 Suppose the radius R of the series Ss a,x" is positive, finite 
k=0 

or infinite. Then 


a) the sum s is aC® map over (—R, R). Moreover, the nth derivative of s 


[e@) 
on (—R, R) can be computed by differentiating So axa" term by term n 


k=0 
times. In particular, for any x € (—R, R) 


(k+ Dapuic’: (2.13) 


Proof. a) By the previous lemma a power series and its derivatives’ series have 


(oe) (oe) 
. 1 . 

identical radii, because \ kaa = —S0 kapa" for any x # 0. The 

k=1 oo 
derivatives’ series converges uniformly on every interval [a,b] Cc (—R, R); 
thus Theorem 2.19 applies, and we conclude that s is C1 on (—R, R) and 
that (2.13) holds. Iterating the argument proves the claim. 

[o.@) 


b) The result follows immediately by noticing > a," is the derivatives’ 
fore) Z k=0 

series of > —* k+l These two have same radius and we can use The- 

= k+1 


orem 2.18. Oo 


2.4 Power series tat) 


Example 2.40 


Differentiating term by term 


= 1 
ko — 
ye =— os ee) (2.15) 
k=0 
we infer that for x € (—1,1) 
Co CO iI 
i — ee 2.16 
2 = 2 Gap ae 


Integrating term by term the series 
[o2) 


So(-1)'a* = : # € (1,1), 


= ge 
obtained from (2.15) by changing x to —2, we have for all x € (—1,1) 
=D)" gp ge 
S- Baa = = log(1+ 2). (2.17) 
k=0 k=1 
At last from 
— il 
k2k _ 
X(-9) x = Lage x € (-1,1), 


obtained from (2.15) writing —2? instead of x, and integrating each term separ- 
ately, we see that for any x € (—1,1) 


> 2 a7*t! — arctana. (2.18) 


i) 
ca 
+ 
— 


[e.2) 
Proposition 2.41 Suppose S an(x —zxo)* has radius R > 0. The series’ 


k=0 
coefficients depend on the derivatives of the sum s(x) as follows: 


1 
= i s) (x9) , Ve = 0. 


Proof. Write the sum as s(x) = >. an(«— xo)"; differentiating each term k times 
h=0 
gives 
s) (x) = S~h(h—1)-++(h—k + 1)an(a — a0)"* 
h=k 


= doth +k)(h+k—1)---(h+ 1)anye(e — to)” . 
h=0 


56 2 Series of functions and power series 


For x = xo only the term indexed by h = 0 contributes, and the above 
expression becomes 


s*) (x9) = Ali VE=> 0. 4 


2.5 Analytic functions 


The previous section examined the properties of the sum of a power series, summar- 
ised in Theorem 2.39. Now we want to take the opposite viewpoint, and demand 
that an arbitrary function (necessarily C°) be the sum of some power series. Said 
better, we take f € C@(X), X CR, xp € X and ask whether, on a suitable interval 
(tp — 6,20 +6) C X with 6 > 0, it is possible to represent f as the sum of a power 
series 0° 


fa) = Ss" ap(x — x0)*; (2.19) 


by Proposition 2.41 we must necessarily have 


(k) 
a, = 2) veo. 
k! 
In particular when xo = 0, f is given by 
— f°*) (0) 
io=>, A ak, (2.20) 
k=0 


Definition 2.42 The series (2.19) is called the Taylor series of f centred 
at xo. If the radius is positive and the sum coincides with f around Xo (i.e., 


on some neighbourhood of xo), one says the map f has a Taylor series ex- 
pansion, or is an analytic function, at xo. If x9 = 0, one speaks sometimes 
of Maclaurin series of f. 


The definition is motivated by the fact that not all C° functions admit a power 
series representation, as in this example. 


Example 2.43 


Consider 
(Os ea 
0 ie= 0. 
It is not hard to check f is C® on R with f(0) = 0 for all k > 0. Therefore 
the terms of (2.20) all vanish and the sum (the zero function) does not represent 
f anywhere around the origin. 
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The partial sums in (2.19) are precisely the Taylor polynomials of f at xo: 


mC) (a 
su(2) = Yo OD (o — 06) = Thal): 


k=0 


Therefore f having a Taylor series expansion is equivalent to the convergence to 
f of the sequence of its own Taylor polynomials: 


lim: 6,(¢) = lim Tf. 2) =F (es Va € (ao — 6,20 + 0). 
In such a case the nth remainder of the series r,,(x) = f(x) — $,(a) is infinitesimal, 
as n — oo, for any x € (4p — 6,20 + 0): 


dim (olay =. 


There is a sufficient condition for a C°° map to have a Taylor series expansion 
around a point. 


Theorem 2.44 Take f € C“(xo — 6,40 + 6), 6 > O. If there are an index 
ko > 0 and a constant M > 0 such that 


k) 


\f)(x)| <M Va € (to — 6,20 + 6) (2.21) 


5k 


for all k > ko, then f has a Taylor series expansion at xq whose radius is at 
least 0. 


Proof. Write the Taylor expansion of f at xq of order n > ko with Lagrange 
remainder (see Vol. I, Thm 7.2): 


1 


(n+ ptr endl = aot . 


f(x) = T fn,xo (x) + 


where x, is a certain point between zp and x. By assumption, for any 
x € (xp — 6,20 + 6) we have 


1 
(n+ 1)! 


= n+1 
Ira(z)| = FO (@n)| be = agit <a (Eel) | 


If we assume |x — ro|/d< 1, then 


im yi2=0 


and the claim is proved. 
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Remark 2.45 Condition (2.21) holds in particular if all derivatives f‘)(x) are 
uniformly bounded, independently of k: this means there is a constant M > 0 for 
which 
[f"(e)| <M, Ve € (20 —6,29 +6). (2.22) 
In fact, from Example 1.1 v) we have x —+ oo as k > o, so te > 1 for k bigger 
or equal than a certain ko. 
A similar argument shows that (2.21) is true more generally if 


[fF (@))<M*, Va € (ao — 5,20 +4). (2.23) 


Examples 2.46 


i) We can eventually prove the earlier claim on the exponential series, that is to 
say 


Se k 
es x 
=0 


We already know the series converges for any x € R (Example 2.23 ii)); addition- 
ally, the map e* is C® on R with f(x) = e*, f“) (0) =1. Fixing an arbitrary 
6 > 0, inequality (2.22) holds since 

f(x) =e"<eP =M, Vare(-6,6). 
Hence f has a Maclaurin series and (2.24) is true, as promised. 
More generally, e” has a Taylor series at each x9 € R: 


x = ee 
e =) ae), VaeR. 
k=0 
ii) Writing —2x? instead of x in (2.24) yields 
2 cs 02h 
k=0 


Integrating term by term we obtain a representation in series of the error func- 
tion 


ofe=— | eee oe ie , 
Vt Jo Vn kL 2k +1 
which has a role in Probability and Statistics. 


iii) The trigonometric functions f(x) = sinx, g(x) = cos are analytic for any 
x € R. Indeed, they are C® on R and all derivatives satisfy (2.22) with M = 1. 
In the special case xp = 0, 


; = tal 
sing = S- peur geet Vc ER, (2.25) 
k=0 
love) = k 
cos x = S- a ae VaeR. (2.26) 
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iv) Let us prove that fora € R\N 


a ~ a k 
1+a2)°= os Vx € (-1,1). 2.21 
ata) = 5° (2) (1,1) (2.27) 
In Example 2.34 v) we found the radius of convergence R = 1 for the right-hand 
side. Let f(x) denote the sum of the series: 


f(x) = > ({) c®, = Wa € (-1,1). 


k=0 
Differentiating term-wise and multiplying by (1+ x) gives 


(1+2)f'(z) =(1+2) ew (E - -Yw+n(,4 ae +a(y ja! 


k=0 


“Eee sO) em 


=0 
Hence f’(x) = a(1+2)~'f(x). Now take the map g(x) = (1+ 2)~°f(x) and 


g(x) = —a(1 +.2)~-°" f(a) + (L+2)-°f"(z) 
= -a(l +a)" f(x) tall t+2)-°" f(a) = 
for any x € (—1,1). Therefore g(x) is constant and we can write 
f(x) =e + om 
The value of the constant c is fixed by f(0) = 1, soc=1. 


v) When a = —1, formula (2.27) gives the ns series of f(x) = =e at the 
origin: 


1 [oe) 
= 
l+a = 
Actually, f is analytic at all points x) #4 —1 and the corresponding Taylor series’ 


radius is R = |1+ ol, ie., the distance of zo from the singular point x = —1; 
indeed, one has 


f(a) = (—1)PR1(1 +) FD 
and it is not difficult to check estimate (2.21) on a suitable neighbourhood of zo. 
Furthermore, the Root Test (Theorem 2.33) gives 


R= lim 4/|1 + ao0|Ft! = |1+29| > 0. 
k—00 


vi) One can prove the map f(x) = ee is analytic at each point x9 € R. This 
does not mean, though, that the radius of the generic Taylor expansion of f is 


+oo. For instance, at xp = 0, 
[oe) 


f(x) = )0(-1)*0"" 
k=0 
has radius 1. In general the Taylor series of f at xo will have radius \/1+4 2 
(see the next section). 
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vi) The last two instances are, as a matter of fact, rational, and it is known that 
rational functions — more generally all elementary functions — are analytic at 
every point lying in the interior of their domain. 


2.6 Power series in C 


The definition of power series extends easily to the complex numbers. By a power 
series in C we mean an expression like 


CO 
a ar(z — zo)” : 
k=0 


where {a;,}%>0 is a sequence of complex numbers, zo € C and z is the complex 
variable. The notions of convergence (pointwise, absolute, uniform) carry over 
provided we substitute everywhere the absolute value with the modulus. 

The convergence interval of a real power series is now replaced by a disc in the 
complex plane, centred at z) and of radius R € [0, +oo]. 

The term analytic map determines a function of one complex variable that is 
the sum of a power series in C. Examples include rational functions of complex 
variable 


where P, Q are coprime polynomials over the complex numbers; with zo € dom f 
fixed, the convergence radius of the series centred at z) whose sum is f coincides 
with the distance (in the complex plane) between zo and the nearest zero of the 
denominator, i.e., the closest singularity. 

The exponential, sine and cosine functions possess a natural extension to C, 
obtained by substituting the real variable x with the complex z in (2.24), (2.25) and 
(2.26). These new series converge on the whole complex plane, so the corresponding 
functions are analytic on C. 


2.7 Exercises 


1. Over the given interval I determine the sets of pointwise and uniform conver- 
gence, and the limit function, for: 


[9)| fale) “Tenn cP) 
[b)] Jn) = TyGRge T= 10, +00) 


fr(x) =e™, I=R 
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d) jae) —=nae™, I=R 


qnrex 


©) In{@) = Srapgre? =o T= R 


Study uniform and pointwise convergence for the sequence of maps: 


fn(x) = na(1 — 2)", x €[-1,1]. 


Does the following formula hold? 


1 1 
lim fn(x) dx = | lim f(x) dx 
0 0 


n> oco n> Co 


Study uniform and pointwise convergence for the sequence of maps: 
fn(x) = arctan nz , ceER. 


Tell whether the formula 


noo 


1 1 
lim fn(x) da = / lim Jolt) de 
holds, with a=0 ora = 1/2. 


4. Determine the sets of pointwise convergence of the series: 


fo) Soe Py] 0+ z) 
Dae a> 0 [))] Sa x #2 
fe] eo * sin 7 > (e- k?=1) 


k=1 
Co 
Determine the sets of pointwise and uniform convergence of the series y ee 
k=2 


Compute its sum, where defined. 


Setting f,(x) = cos Z, check Se fh (2) converges uniformly on |—1, 1], while 
k=1 


S- f(a) converges nowhere. 
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7. Determine the sets of pointwise and uniform convergence of the series: 


ee 2 vos hy" = G 4 0?) Pe _ 1] 


[o2) 
Knowing that s a,4* converges, can one infer the convergence of the following 
k=0 
series? 


a) S~ ax(—2)* b) So ax(—4)4 
k=0 k=0 


[o.@) 
9. Suppose that Ss" a,x" converges for x = —4 and diverges for x = 6. What can 


be said about the convergence or divergence of the following series? 


0) Yas b) at! 
k=0 k=0 
c) S > ax(—3)4 d) S ‘(-1)*a,9" 
k=0 k=0 
Let p be a positive integer. Determine, as p varies, the radius of convergence 
of = 
57 
7x". 
<— (pk)! 


11. Find radius and set of convergence of the power series: 


co ak oo (=1)¥a* 
Dae 2 eer 


k=1 k=0 
c) Ske d) ee 
3* logk 
k=0 k=2 
= = Bia =1)* 
e) S° ka -4)4 f) ar ae 
k=0 k=0 
r, eee h) Sow (20 —1)* 
k=1 k=1 
i) = kak 0) = (—1)Fa2k-1 
; isSoG OR) 2k —1 


k=1 
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The function 


(—1)¥x 2k+1 
> 


is called Bessel function of order 1. egies its domain. 


Given the function 
Tie) =14+2Qe+a? 4203 +---= Sagat, 


where adgx = 1, d2n41 = 2 for any k > 0, determine the domain of f and an 
explicit formula for it. 


14. Determine the convergence set of the series: 
lee) k 
2 2 k 1+2 

] (GF) @-0 Ya) 

Soe: AL ps 1 39 
2) ree | yop Oe 
Bp k* +1 Oo) ea 

Determine the radius of convergence of the power series 


foe) 
) av * yk 
k=0 


as the real parameter a > 0 varies. 


16. Expand in Maclaurin series the following functions, computing the radius of 
convergence of a series thus obtained: 


1 
la) | £@) ==, [b)] Fle) =e 


17. Expand the maps below in Taylor series around the point x9, and tell what is 
the radius of the series: 


fa] ==, w= 
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Verify that 
S- ei we 
= ~ i= oP 


for |x| < 1. 


19. Write the first three terms of the Maclaurin series of: 


fa)| f() = 80= 9) b) f(z)=e* cose =o) f(z) = SE 


20. Write as Maclaurin series the following indefinite integrals: 
[suede b) [vite 


21. Using series’ expansions compute the definite integrals with the accuracy re- 
quired: 


1 
a) . sin x? dz, up to the third digit 
0 


1/10 
V1l+ae%dz, with an absolute error < 1078 


2.7.1 Solutions 


1. Limits of sequences of functions: 
a) Since f,(0) = 0 for every n, f(0) = 0; if x £0, 
NX 1 
(2)~ = = +0 fo > p00. 
fn(2) pee pee = 
The limit function f is identically zero on I. 
For the uniform convergence, we study the maps f, on J and notice 


n(1 — 2n3x3) 


/ = 
f(x) ~~ (1 + n3x3)? 
and f/(¢) =0 for 2= Tan with f(t) = rp (Fig. 2.3). Hence 
2 
sup fn(®)| = =z 
ee ( M 3V2 


and the convergence is not uniform on [0,+00). Nonetheless, with 6 > 0 fixed 
and n large enough, f,, is decreasing on [d, +00), so 


sup |fn(x)| = fn(d) > 0 as m— +00. 
xE[d, +00) 


The convergence is uniform on all intervals [6,-++co) with 6 > 0. 
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32 fi 


fo 
fe 


Figure 2.3. Graphs of f, and f relative to Exercise 1. a) 


b) Reasoning as before, the sequence converges pointwise on J to the limit f(#) = 


0, for any x € I. Moreover 
1 —n?2? 
/ 4 
Fl) ~~ (1 + n242)2 ’ 


and for any x > 0 


1 1 1 
Loa) =< a 7 with fil =) 5 


Thus there is uniform convergence on J, since 


1 
lim sup |fn(x)| = lim — =0. 


N00 yE[, +00) noo 2n 


See Fig. 2.4. 
c) The sequence converges pointwise on (—oo, 0] to 
0 ifx <0, 
fay = 4 
t iee='0, 


We have no uniform convergence on (—oo, 0] as f is not continuous; but on all 
half-lines (—oo, —6], for any 6 > 0, this is the case, because 


Figure 2.4. Graphs of f, and f relative to Exercise 1.b) 
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lim sup e™” = lim e 
n—0o x€(—00,—4] n—- oo 


me = 


d) We have pointwise convergence to f(x) = 0 for all x € [0,+00), and uniform 
convergence on every [6, +00), 6 > 0. 


e) Note fn(0) = 1/2. As n > oo the maps fy satisfy 


(4/3)"" ife<0, 
fle) | app ifa>0. 


Anyway for « 4 0 
f(z) = lim f,(z) =0. 
NCO 
Hence the sequence converges pointwise on R to the limit 


(0 ife 40, 
HO i 


The convergence is not uniform on R because the limit is not continuous on that 
domain. But we do have uniform convergence on every set As = (—oo, —6] U 
[5, +00), 6 > 0, since 


lim. sup |fn(2)] = lim max (fn(6), fa(—8)) = 0. 


noo we As 


2. Notice fn(1) = fa(—1) = 0 for all n. For any x € (—1, 1) moreover, 1 — x? < 1; 
hence 
lim Jnle) = 0; Va € (-1,1). 


Then the sequence converges to f(z) =0, Va € [-1,1]. 

What about uniform convergence? For this we consider the odd maps fn, so it 
suffices to take x € [0,1]. Then 
fi(z) =n(Q — 2?) — Ine? (1 — 2?)"") = n( — 2?)?-1 (1 — a? — 2nz?), 


n 


and f,(x) has a maximum point 7 = 1//1+ 2n (and by symmetry a minimum 
point « = —1//1+2n). Therefore 


wonton = fs (apg) = pte (te) 
gg) an) ean ee 


and the convergence is not uniform on [—1, 1], for 


lim ———— — : eo !/2 jim + 
1m = 1m = CO. 
noo 4/1 +2n \1+2n noo 4/1 + 2n 
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From this argument the convergence cannot be uniform on the interval [0, 1] either, 
and we cannot swap the limit with differentiation. Let us check the formula. Put- 
ting t = 1 — x”, we have 


1 1 
_ 2 Wie. * n 1 
Pew flay = Jin 5 | ee oma) oO 
while ; 
; lim (ede =O), 
0 nN co 
3. Since 
mi2 wed, 
im, joie 0 if~=0, 
noo 
—n/2 ifa <0, 


we have pointwise convergence on R, but not uniform: the limit is not continuous 
despite the f, are. Similarly, no uniform convergence on [0, 1]. Therefore, if we put 
a = 0, it is not possible to exchange limit and integration automatically. Compute 
the two sides of the equality independently: 


1 


1 
lim inie) da = lim (carctanna|, — | eae | ) 
0 0 


x 
n—- oo noo 1+ n2 7? 


log(1 272) 11 
lim arctan n — eaten) | 
no 


2n 0 


lim (arctan — 
no 


log(l+n?)\ 
2n a 
Moreover 

WT 


1 1 
/ Jim fn(2) dz = | = de aed 
Hence the equality holds even if the sequence does not converge uniformly on [0, 1]. 


If we take a = 1/2, instead, we have uniform convergence on [1/2,1], so the 
equality is true by Theorem 2.18. 


4. Convergence set for series of functions: 


a) Fix a, so that 
(k + 2)* 1 
f(x) = ———S= ; ko. 
( ) B4 Jk k3-« 
The series is like the generalised harmonic series of exponent 3 — x, so it 
converges if 3— x2 > 1, hence x < 2, and diverges if 3-—x <1, so x> 2. 
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b) Given x, use the Root Test to see 


k 
lim ‘/lf,(a)| = lim (1+ =) =e? 
k-> 00 k- 00 k 


Then the series converges if e” < 1, ie., x < 0, and diverges if e” > 1, so 
x > 0. If x = 0, the series diverges because the general term is always 1. The 
convergence set is the half-line (—oo, 0). 


c) If e =1, the general term does not tend to 0 so the series cannot converge. If 
x #1,as k > co we have 


k 


fete) ~{ ie 1, 


ak ifa<l. 


In either case the series converges. Thus the convergence set is (0,-+00) \ {1}. 


d) If |a| < 2, the convergence is absolute as | f;,(x)| ~ (l2l)* k > co. Ifa < —2or 
x > 2 the series does not converge since the general term is not infinitesimal. 
The set of convergence is (—2, 2). 


e) Observe that 
1 


The series converges absolutely if 1 — x > 1, so x < 0. By Leibnitz’s Test, the 
series converges pointwise if 0 < 1—a < l,ie.,0 <a < 1. It does not converge 
(it is indeterminate, actually) if > 1, since the general term does not tend 
to 0. The convergence set is thus (—oo, 1). 


f) Given x, the general term is equivalent to that of a generalised harmonic series: 


fe(2) = (—) - (=). ae. 


The convergence set is (1, +00). 


5. Geometric series with g = e”, converging pointwise on (—oo,0) and uniformly 
on (—oo, —6], for any 6 > 0. Moreover for any x < 0 


asd 1 Qa 
Soe?) = —l-e’= — 


~ [et 
k=2 


co 
: x x, 
6. For any x € R, lim cos— = 1 4 0. Hence the convergence set of y cos — is 
k— oo k kal k 
empty. 
As fj,(z) = —¢sin £, we have 
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(oe) 1 Co 

Since a, RB converges, the M-test of Weierstrass tells Ss" fi,(@) converges uni- 
k=1 k=1 

formly on {—1, 1]. 


7. Sets of pointwise and uniform convergence: 


a) This is harmonic with exponent —1/, so: pointwise convergence on (—1,0), 
uniform convergence on any sub-interval [a,b] of (—1, 0). 


b) The Integral Test tells the convergence is pointwise on (—oo, —1). Uniform 
convergence happens on every half-line (—oo, —6], for any 6 > 1. 
c) Observing 
1 2 2 


the series converges pointwise on R. Moreover, 
1 2 
sup | fx(2)| = f,,(0) = exp zp losk —1=M,. 
rE 


2 = 


The numerical series 3 M;, converges just like eo . The M-test implies 


k=1 k=1 
the convergence is also uniform on R. 


8. The assumption ensures the radius of convergence is bigger or equal than 4. 
Hence the first series converges, while for the second one we cannot say anything. 


9. Convergence of power series: 


a) Converges. b) Diverges. c) Converges. d) Diverges. 
10. We have 
ak (p(k + 1))! (kl)? (pk + 1)(pk + 2)--+ (pk +p)” 
Thus 
‘ Ak+1 k+1 k+1 k+1 1 
lim im oS, 
k-00 Ak ee pk+1 + 1 ok +2 + 2 pk + p pP 


and the Ratio Test gives R = p?. 


11. Radius and set of convergence for power series: 


a) R=1,I=[-1,1) b) R=1,1=(-1,]] 
c) R=1, I =(-1,1) d) R=3, I = (-3,3] 
e) R=1, I =(3,5) f) R= 10, I =(-9,11) 
2) R=3,I =(-6,0] h) R=0,1={3} 


i) R=+o0,I=R ()R=1,l=[-1,1]] 
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12. Using the Ratio Test: 
Ri(k-+- 127? 
= in —  _—_, = lim —d_, ~- 
k-s00 (k +1)! (k +2)! 22443 — k-00 4(k + 2)(k + 1) 
Hence R = +00 and the domain of the function is R. 
13. Since jim \/|ax| = 1, the radius is R = 1. It is straightforward to see the 
— co 


series does not converge for x = +1 because the general term does not tend to 0. 
Hence dom f = (—1,1), and for x € (—1,1), 


Qk+1 
ak 


lim 


k-> oo 


=: 


1 22 1+227 
2k 2k+1 
= +2 ) —— + = : 
f(z) rai a 1-22 1-2? 1 — x? 


14. Set of convergence: 
a) Put y =x? —1 and look at the power series 
Qk 
> (3) ¥ 
k=1 
in the variable y, with radius R,. Since 
aw EZ 2 
lim (=) == 
k—- oo 


ie 3. For y = +3 the series does not converge as the general term does not 


tend to 0. In conclusion, the series converges if —3 <g?-1le< 3. The first 


inequality holds for any x, whereas the second one equals x? < 2; The series 


converges on ( — /5 3). 


[o.) 
b) Let « £1; set y = +* and consider the series S- 
k=1 


1 
zy" in y. Since 


Vk 


1-2 


1 
lim ¢ = lim e #8” — 1 
aes Vk ae , 
we obtain R, = 1. For y = +1 there is no convergence as the general term 
does not tend to 0. Hence, the series converges for 
1+2 
-l< a <a 1.é., oe 
1l-—z 
“k+1 
: 2 ? : 
c) Write y = 2~* and consider the power series Ss" al y*. Immediately we 
k=1 
have R, = 1; in addition the series converges if y = —1 (like the alternating 


harmonic series) and diverges if y = 1 (harmonic series). Returning to the 
variable x, the series converges if —1 < 2-** <1. The left inequality is trivial, 
the right one holds when x # 0. Overall the set of convergence is R \ {0}. 
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2 © ok 
3 
d) When x 4 0 we set y = aaa = ona and then study 2 =u" Its radius 


equals Ry = 5, so it converges on [-$, <). 
Back to the x, we impose the conditions 


2 


1 x Ze 
le a 
3° 1+2 ae 
This is equivalent to 2x? <1, making ( — wee ¥2) \ {0} the convergence set. 


15. Exploiting the Ratio Test we have 


JVk+1 
: Qk+1 : a ; a va . a: ere 
li = lim = lim aV*t! Vie _ lim avtfitve = 1, 
k-0o}| Ap k00 gVk k—¥00 k—+00 


Hence R = 1 for any a> 0. 


16. Maclaurin series: 


a) Using the geometric series with ¢ = —$, 
3 Pe ee 0° ,akt3 
Salone =5h(- “ = d-1) OR+L? 
+2 k=0 k=0 
this has radius R = 2. 
b) With the geometric series where gq = x7, we have 
_ = 2k 
Lae eee bahia ; 


whose radius is R= 1. 


c) Expanding the function g(t) = log(1 +t), where t = —4, we obtain 


x — (—1)**t x\k at” 
= log3 + log(1 — =) = log3 —— (-=) SC ae 
f(a) = log3 + log(1 — 3) = log ps : ; og Dik 


This has radius R = 3. 


1 
d) Expanding g(t) = Gp (recall (2.16)) with t = , 
- 4 eee a\k SX (k+1)a*8 
ie) 16 (1— #2)? — Te +9 (5) = 2 Artz? 


Now the radius is R = 4. 
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e) Recalling the series of g(t) = log(1 +t), with t = x and then t = —z, we find 


co Gas ae 2 a oo ye 
f(a) = log(1 + x) — log(1 — 2) = $5 ~—— y= ale 

k=1 k=1 

= = 1)F+1 = 2 2k+1 

=S>=( = i 1) a* = SS aia 

= ran rar 2k+1 

thus the radius equals R = 1. 
es op Skt4 
f) ree = LY Gea has radius R = oo. 


g) Since sin? x = (1 — cos 2x), and remembering the expansion of g(t) = cost 
with t = 27, we have 


h) f(x) = . (log 2)" , with R= +00. 


k) 
k=0 


17. Taylor series: 


a) One can proceed directly and compute the derivatives of f to obtain 


f(x) = (yk , whence f() =(-1)¥k!, VEEN. 
Therefore ie 
fa) =>(-e-1h, R=. 
k=0 


ay, one could set t = x — 1 and take the Maclaurin series of f(t) = 
ae + ; to arrive at the same result: 


b) Here as well we compute directly 


fila) = 50? f(x) = (-1) ee 


for all k > 2; then 


2.7 Exercises 


= -3-5+--(2k—3 
f(a )= 242 4)+S0(-1 ett ACY: )5= (2k— (g ays 
k=1 


and the radius is R = 4. 
Alternatively, put t = x — 4, to the effect that 


t 
Jz=V44+t=2 lag 
=24 He-4) +29 (2) (5) 
k=2 
i Se = ee (SS ae 
=24+he-a +25 2a) te FA) mat J (e-4)! 
k=2 : 
Sy 1635+ Ok — 3) (-1)" k 
ye 
k=2 : 
- 1-3-5-+-(2k-8 
=24+-(%-—4)+ (—ayben 89 OE a — aye 
k=2 : 
c) ese} = 2)*, R=2 
<= kak 


CO (oe) (oe) (oe) 
2 
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18. The equality is trivial for x = 0. Differentiating term by term, for |x| < 1 we 
have 


1 1 2 
k k-1 _ 4) pk-2 — 
ey a s ka =o a? 2 k(k — 1)x aa ear 


From the last siete when x # 0, we have 


1 1 
_ oa 2_,k-1 pat = 2k 
(ia) y k(k + 1)x y Ke +5 kx =D a + ey 


k=1 k=1 k=1 k=1 


Therefore 


Dat = ee 
(l-2)} (1-2) (1—-2)3" 


19. Maclaurin series: 


a) Using the well-known series of g(x) = log(1 — x) and h(x) = e® yields 


2 rs) 2 rc) 
f(e) =e* log(1-2) = (1-e@+>-S+--)(-@->-5 
1 1 1 

=-r+a?—-2? gut 5m" su + 
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20. Indefinite integrals: 


a) Since 


a term-by-term integration produces 


5 oo (=1)°9"""" 
[sinz aa d (4k + 3)(2k +1)! 


with arbitrary constant c. 


4 [o2) 
a al _j\pai bed 0 R= 8). shag 
b) [vite de=c+a+ 3 =) 1) DERIK +1) at . 


21. Definite integrals: 


1 
a) | sina? dx ~ 0.310. 
0 


1/10 
b) V1+ 22 dx ~ 0.10001250. 
0 


3 


Fourier series 


The sound of a guitar, the picture of a footballer on tv, the trail left by an oil tanker 
on the ocean, the sudden tremors of an earthquake are all examples of events, either 
natural or caused by man, that have to do with travelling-wave phenomena. Sound 
for instance arises from the swift change in air pressure described by pressure waves 
that move through space. The other examples can be understood similarly using 
propagating electromagnetic waves, water waves on the ocean’s surface, and elastic 
waves within the ground, respectively. 

The language of Mathematics represents waves by one or more functions that 
model the physical object of concern, like air pressure, or the brightness of an 
image’s basic colours; in general this function depends on time and space, for in- 
stance the position of the microphone or the coordinates of the pixel on the screen. 
If one fixes the observer’s position in space, the wave appears as a function of the 
time variable only, hence as a signal (acoustic, of light, ...). On the contrary, if we 
fix a moment in time, the wave will look like a collection of values in space of the 
physical quantity represented (think of a picture still on the screen, a photograph 
of the trail taken from a plane, and so on). 

Propagating waves can have an extremely complicated structure; the desire to 
understand and be able to control their behaviour in full has stimulated the quest 
for the appropriate mathematical theories to analyse them, in the last centuries. 
Generally speaking, this analysis aims at breaking down a complicated wave’s 
structure in the superposition of simpler components that are easy to treat and 
whose nature is well understood. According to such theories there will be a collec- 
tion of ‘elementary waves’, each describing a specific and basic way of propagation. 
Certain waves are obtained by superposing a finite number of elementary ones, yet 
the more complex structures are given by an infinite number of elementary waves, 
and then the tricky problem arises — as always — of making sense of an infinite 
collection of objects and how to handle them. 

The so-called Fourier Analysis is the most acclaimed and widespread frame- 
work for describing propagating phenomena (and not only). In its simplest form, 
Fourier Analysis considers one-dimensional signals with a given periodicity. The 
elementary waves are sine functions characterised by a certain frequency, phase 
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and amplitude of oscillation. The composition of a finite number of elementary 
waves generates trigonometric polynomials, whereas an infinite number produces 
a series of functions, called Fourier series. This is an extremely efficient way of rep- 
resenting in series a large class of periodic functions. This chapter introduces the 
rudiments of Fourier Analysis through the study of Fourier series and the issues 
of their convergence to a periodic map. 

Beside standard Fourier Analysis, other much more sophisticated tools for ana- 
lysing and representing functions have been developed, especially in the last dec- 
ades of the XX century, among which the so-called Wavelet Analysis. Some of 
those theories lie at the core of the recent, striking success of Mathematics in sev- 
eral groundbreaking technological applications such as mobile phones, or digital 
sound and image processing. Far from obfuscating the classical subject matter, 
these latter-day developments highlight the importance of the standard theory as 
foundational for all successive advancements. 


3.1 Trigonometric polynomials 


We begin by recalling periodic functions. 


Definition 3.1 A map f :R — R is periodic of period T > 0 if 


ea Wo ok. 


If f is periodic of period T it is also periodic of period kT, with k € N \ {0}, 
including that f might be periodic of period T/k, k € N \ {0}. The minimum 
period of f is the smallest T (if existent) for which f is periodic. Moreover, f is 
known when we know it on an interval |[%o, 20 + T) (or (to, 20 + T]) of length T. 


Usually one chooses the interval [0, 7) or [ - a t). Note that if f is constant, it 
is periodic of period T,, for any T > 0. 
Any map defined on a bounded interval [a,b) can be prolonged to a periodic 
function of period T' = b — a, by setting 
fete y= Fle), k eZ, Vx € |a,b). 
Such prolongation is not necessarily continuous at the points 7 = a+ kT, even if 
the original function is. 


Examples 3.2 


i) The functions f(x) = cosa, g(x) = sina are periodic of period 2km with 
k € N \ {0}. Both have minimum period 27. 


ii) f(a) = cos = is periodic of period 87, 167,...; its minimum period is 87. 


iii) The maps f(x) = coswa, g(x) = sinwaz, w # 0, are periodic with minimum 
period To = an 


iv) The mantissa map f(x) = M(2) is periodic of minimum period To = 1. 
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The following properties are easy to prove. 


Proposition 3.3 Let f be periodic of period T > 0; for any xo € R then, 


[ tee = [Pr soar. 


In particular, if x» = —T/2, 


” F(a) di = i f@)\dx 
0 


25) 


Proof. By the properties of definite integrals, 
i xo xotT oe 
| f(e)ax = f Fla)ax + f Fla)ax + | Jia) de. 
0 0 LO xot+T 
Putting « = y+T in the last integral, by periodicity 


i foyae= [y+ trav f soray— fara, 


Gal 


whence the result. 


Proposition 3.4 Let f be a periodic map of period T; > 0 and take Tz > 0. 


Then g(a) — f( He) is periodic of period T>. 


Proof. For any x € R, 


g(a +T2) = f(F@+D)) = f(Fe+n) =1(F) = Ge) 


The periodic function f(#) = asin(wx + ¢), where a,w,f are constant, is 
rather important for the sequel. It describes in Physics a special oscillation of sine 
type, and goes under the name of simple harmonic. Its minimum period equals 
P= 2m and the latter’s reciprocal = is the frequency, i.e., the number of wave 
oscillations on each unit interval (oscillations per unit of time, if « denotes time); 
w is said angular frequency. The quantities a and ¢ are called amplitude and 
phase (offset) of the oscillation. Modifying a > 0 has the effect of widening or 
shrinking the range of f (the oscillation’s crests and troughs move apart or get 
closer, respectively), while a positive or negative variation of ¢ translates the wave 
left or right (Fig. 3.1). A simple harmonic can also be represented as asin(wx+¢) = 
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—20 =| 


—20 


Figure 3.1. Simple harmonics for various values of angular frequency, amplitude and 
phase: (w,a,) = (1,1,0), top left; (w,a,¢) = (5, 1,0), top right; (w,a,6) = (1,2,0), 


bottom left; (w,a,¢) = (1,1, $), bottom right 


acoswxz+ Bsinwxz with a = asing and 2 = acos ¢; the inverse transformation of 
the parameters is a = \/a? + 62, 6 = arctan 3 . . . 

In the sequel we shall concentrate on periodic functions of period 27 because 
all results can be generalised by a simple variable change, thanks to Proposition 
3.4. (More details can be found in Sect. 3.6.) 

The superposition of simple harmonics whose frequencies are all multiple of 
one fundamental frequency, say 1/27 for simplicity, gives rise to trigonometric 
polynomials. 


Definition 3.5 A trigonometric polynomial of order or degree n is a 
finite linear combination 


P(x) =ao + a1 cosx+b;sinz+...+a,,cosnz + b, sinnz 


=agt So (ax coskx + by sin kx) , 
pai 


where az, by are real constants and at least one of adn, bn is non-zero. Rep- 
resenting simple harmonics in terms of their amplitude and phase, a trigono- 
metric polynomial can be written as 


P(x) =ao+ Sy a, sin(kx + yx). 
k=1 
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The name stems from the observation that each algebraic polynomial of degree 
n in X and Y generates a trigonometric polynomial of the same degree n, by 
substituting X = cosxz, Y = sinx and using suitable trigonometric identities. For 
example, p(X,Y) = X?+2Y? gives 


p(cosz,sinx) = cos® x + 2sin? x 
1 + cos 2x 9 L = cos 22 
2 


= COS ZX 


1 1 
1+ =cos# —cos2x% + — cos 2 cos 2¢ 
2 2 
1 1 
=1+ 5 COSH — Cos 2ar + q(cosx + cos 3x) 
3 1 
=1+ 7 CO8E — Cos 2a + 7 008 82 = P(x). 


Obviously, not all periodic maps can be represented as trigonometric polynomials 
(c.g., f(z) =e8™*). At the same time though, certain periodic maps (that include 
the functions appearing in applications) may be approximated, in a sense to be 
made precise, by trigonometric polynomials: they can actually be expanded in 
series of trigonometric polynomials. These functions are called Fourier series 
and are the object of concern in this chapter. 


3.2 Fourier Coefficients and Fourier series 


Although the theory of Fourier series can be developed in a very broad context, 
we shall restrict to a subclass of all periodic maps (of period 27), namely those 
belonging to the space Co,, which we will introduce in a moment. First though, a 
few preliminary definitions are required. 


Definition 3.6 A map f periodic of period 27 is piecewise continuous if 
it is continuous on |0, 27] except for at most a finite number of points xo. At 
such points there can be a removable singularity or a singularity of the first 
kind, so the left and right limits 


ee) =i (a) and ee ti 


aL = 
L>+Lo LL 


exist and are finite. 
If, in addition, 


(ao) = 5 (Flag) + fl@o)): 


at each discontinuity xo, f is called regularised. 
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Definition 3.7 We denote by Co, the space of maps defined on R that are 


periodic of period 27, piecewise continuous and regularised. 


The set Co, is an R-vector space (ie, af + Bg € Com for any a, € R and all 
f,g € Cor); it is not hard to show that given f, g € Co,, the expression 


27 


(f,g= f (x) g(a) da (3.2) 


0 


defines a scalar product on Co, (we refer to Appendix A.2.1, p. 521, for the 
general concept of scalar product and of norm of a function). In fact, 


i) (f, f) > 0 for any f € Co,, and (f, f) =0 if and only if f = 0; 
ii) (f, Q=(, f), for any FG © Coys 
iii) (af + Bg, h) =a(f, h)+6(g, h), for all f, g, h € Co, and any a,8 ER. 


The only non-trivial fact is that (f, f) = 0 forces f = 0. To see that, let x1,..., 2 
be the discontinuity points of f in [0,27]; then 


27 Ly x2 27 
0= Pleaz= f° Part | fear +... Pan: 
0) 0 t1 In 


As f is continuous on every sub-interval (2;, ri+1), we get f(x) = 0 on each of 
them. At last, f(x;) = 0 at each point of discontinuity, by (3.1). 
Associated to the scalar product (3.2) is a norm 


[fll =f, IY? = ( [ Li(@) Paw)” 


called quadratic norm. As any other norm, the quadratic norm enjoys the fol- 
lowing characteristic properties: 


i) |lfllo => 0 for any f € Co, and lfllo = 0 if and only if f =0; 
ii) llaf|l2 = lal ||flle, for all f € Co, and any a ER; 
iii) If + glle < Ifllo + llglla, for any f, g € Cor. 


The Cauchy-Schwarz inequality 


(fl <llfllaligie, = Veg € Con, (3.3) 


holds. 
Let us now recall that a family {f;,} of non-zero maps in C2, is said an ortho- 
gonal system if 


(fr; fx) =0 fork # £. 
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By normalising fre = for any k, an orthogonal system generates an or- 


ih 


thonormal system {fr}, 


1 ifk=é, 


(fas f= bu = { j fkZl. 


The set of functions 


f= {1, cos x, sinz,...,coskaz, sinke,... } 


(3.4) 
- { p(x) =coskr:k > o U { v(x) =sinkr ik > i} 
forms an orthogonal system in Cree whose associated orthonormal system is 
# = {— : : ee ee \ (3.5) 
COs x, sin x, coskx, —=sinkz,...¢. : 
Tie Fe Fa Fecal, 


This follows from the relationships 


21 20 
| cos? kde = [ sintkxdr=a, Vk>1 
0 0 


20 20 
i cos kx costrde = | sinke sintcdr =O, Vee pO ke 2, 
0 0 


Dye 
i coskx sinftdx=0, Vk, >0. 
0 


The above orthonormal system in Con plays the role of the canonical orthonor- 
mal basis {e,} of R”: each element in the vector space is uniquely writable as 
linear combination of the elements of the system, with the difference that now 
the linear combination is an infinite series. Fourier series are precisely the expan- 
sions in series of maps in om viewed as formal combinations of the orthonormal 
system (3.5). 

A crucial feature of this expansion is the possibility of approximating a map 
by a linear combination of simple functions. Precisely, for any n > 0 we consider 
the (2n + 1)-dimensional subset P,, C C2, of trigonometric polynomials of degree 
<n. This subspace is spanned by maps vy; and w, of F with index k < n, forming 
a finite orthogonal system F,. A “natural” approximation in P, of a map of Co, 
is provided by its orthogonal projection on P,,. 
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Definition 3.8 We call orthogonal projection of f € co on Pp, the ele- 
ment 3,7 © Pp, defined by 


ede) Oy) So (ax coska + by sin kx) , (3.7) 
k=1 


where az, by are 


f(x) coska dz , 


f(x)sinka dz , 


Note S;,,¢ can be written as 


=> aKpR(x) + > bee (x 


and the coefficients a;, b; become 


(fe k) . _ (f,w k) 
(wey oe) =O Oh ate) 


There is an equivalent representation with respect to the finite orthonormal 
system F,, made by the elements of F of index k <n; in fact 


ap = fork > 1. (3.9) 


Snp(x) = S) anGe(x) + > bude (a) 
k=0 kai 
where a . 
@x=(f, Gn) fork>0, 6b. =(f, dx) fork>1. (3.10) 


The equivalence follows from 


1 1 : ae 
ak = Gs va b= 7. (V, bil ) = ak 
Iexll Pella “° [alle Ieaale 


hence 


similarly one proves bxwp(x) = brew (x). 
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To understand the properties of the orthogonal projection of f on Py, we stress 
that the quadratic norm defines a distance on C2,: 


d(f, 9) =\lf —glle, f, 9 € Con. (3.11) 


The number || f — g||z2 measures how “close” f and g are. The expected properties 
of distances hold: 


i) d(f, g) > 0 for any f, g € Ca and d(f, g) = 0 precisely if f = g; 


ii) d(f, 9) =d(g, f), for all f, 9 € Con; 
iii) d(f, 9) <d(f, h) +d(h, g), for any f, g, h € Con. 


The orthogonal projection of f on P, enjoys therefore the following properties, 
some of which are symbolically represented in Fig. 3.2. 


Proposition 3.9 i) The function f — Sn,¢ is orthogonal to every element 
of Pn, and Sy is the unique element of P, with this property. 

it) Sip. is the element in Py with minimum distance to f with respect to 
(3.11), ie., 


= = min ||f — Pllc. 
If —Snjlla = min ILf — Plo 


Hence Sy, 18 the polynomial of Py, that best approximates f in quadratic 
norm. 
iii) The minimum square error ||f — S;,,f\2 of f satisfies 


n 


21 
lf — Sn fll2 = / If(a)2dx — 2a — Sr(a2 +02). (3.12) 


k=1 


Proof. This proposition transfers to C2, the general properties of the orthogonal 
projection of a vector, belonging to a vector space equipped with a dot 
product, onto the subspace generated by a finite orthogonal system. For 
the reader not familiar with such abstract notions, we provide an adapted 


proof. 
For i), let 


P(x) = S- Gxyn(2) + >> bebe (a) 
k=0 k=1 


be a generic element of P,,. Then f — P is orthogonal to every element in 
Pry if and only if, for any k <n, 


(f-Pype)=0 and (f—Pwe)=0. 
Using the orthogonality of yz, We, that is equivalent to 


(f,~ b) —Ge(Yr, Pr) =9 and (fF, &) — be (We, Ve) = 0. 


84 3 Fourier series 


Figure 3.2. Orthogonal projection of f € Con On Pn 


Hence @, = ax and by = by for any k <n, recalling (3.9). In other words, 
PS Bae 
To prove it), note first that 

If +3 =I IS +llolls+2,9), VE, 9 € Con 


by definition of norm. Writing || — P||? as ||(f — Sn,¢) + (Sn,¢ —P)||?, and 
using the previous relationship with the fact that f — S,~ is orthogonal 
to Sp, —P € Pp by 1), we obtain 


If — Pla =f — Sn slo + llSn.¢ — PIS; 


the equation is to be considered as a generalisation of Pythagoras’ Theorem 
to spaces with a scalar product (Fig. 3.2). Then 


If -—Plla 2 lf —Sn sll, VP €Pn, 


and equality holds if and only if S;,,¢— P = 0 if and only if P = S,,,¢. This 


proves ii). 
Claim iii) follows from 
eS Sagle =f =Sapet — Sag) 
— (f, ; im Sn,f) _ (Brits i an oye) — (f, f ia Bact) 
= |If12 -— F, Sn.) 
and 
27 n 
(f, Suz) =f f(@) Sn,s(@) dee = 2nag + wy (apt be). og 


i= 
At this juncture it becomes natural to see if, and in which sense, the polynomial 
sequence S,, ¢ converges to f as n —> oo. These are the partial sums of the series 


lee) 
ag + So (ax cos kx + b; sin ka) , 
k=1 
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where the coefficients az, by, are given by (3.8). We are thus asking about the 
convergence of the series. 


Definition 3.10 The Fourier series of f € Co, is the series of functions 
ao + So (ax coskx + by sin ka) , (3.13) 
k=1 


where ao, ax, by (K > 1) are the real numbers (3.8) and are called the Fourier 
coefficients of f. We shall write 


fant So (ax coskx + by sink) . (3.14) 
k=1 


The symbol © means that the right-hand side of (3.14) represents the Fourier 
series of f; more explicitly, the coefficients a; and b; are prescribed by (3.8). Due 
to the ample range of behaviours of a series (see Sect. 2.3), one should expect the 
Fourier series of f not to converge at all, or to converge to a sum other than f. 
That is why we shall use the equality sign in (3.14) only in case the series pointwise 
converges to f. We will soon discuss sufficient conditions for the series to converge 
in some way or another. 

It is possible to define the Fourier series of a periodic, piecewise-continuous 
but not-necessarily-regularised function. Its series coincides with the one of the 
regularised function built from f. 


Example 3.11 


Consider the square wave (Fig. 3.3) 
-1 if-7m<a2<0, 


fiz)=< 0 it =), 49, 


1 wo<a<7. 


By Proposition 3.3, for any k > 0 
20 T 
(x) coska da = f(z) coskadx=0, 

0 —T 


as f(x) cos ka is an odd map, whence ay, = 0. Moreover 
: ; 2.f" 
= = f(a) sinkede = = [ sinka dx 
—T WT Jo 


O if k is even, 


2 2 
= —(1—coska) = —(1—(-1)*) = 
ken ae ae ee i if k is odd. 
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Figure 3.3. Graphs of the square wave (top) and the corresponding polynomials 
Si,¢(x), So,¢(x), Sa1,¢(a) (bottom) 


Writing every odd k as k = 2m +1, the Fourier series of f reads 


Co 


4 1 
as: Ss" Ima ante + le. 


The example shows that if the map f € Con is symmetric, computing its Fourier 
coefficients can be simpler. The precise statement goes as follows. 


Proposition 3.12 If the map f € Coz is odd, 
Ge 04 Vk >0, 


2 TS 
== f f(x)sinka daz, Vie 
To 


If f is even, 
b, = 0, VES, 


Tv 9) AS 
i J (nde a, == f fia eos ka de. ee Ik 
0 0 


Proof. Take, for example, f even. Recalling Proposition 3.3, it suffices to note 
f(x) sin ka is odd for any k > 1, and f(x) coska is even for any k > 0, to 
obtain that 

21 wT 
(2) sin kode = fieismkody =0, Vk > 1, 
0 —T 
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and, for any k > 0, 


20 T 7 
(x) cos ka da | f(x) cos ka dx = 2 | f(x) coska daz. 
0 = 0 O 
Example 3.13 
Let us determine the Fourier series for the rectified wave f(z) = |sinz| 


(Fig. 3.4). As f is even, the b, vanish and we just need to compute a, for 
k > 0: 


ae 1 f* a 
ao = — jsina| de == | sinzdr = —, 
ae a Tw Jo 1 
i ae 
a == / sin 2x dx = 0, 
T Jo 


ak 
TW 


a i 1 [" 
= | sine cos kedz = -{ (sin(k +1)x —sin(k — i) da 
0 TE 


_ (4 —cos(k+1)r — 1—cos(k — vr) 


1 k+1 k-1 
0 if k is odd, 
= 4 Vim L. 
“ae =D if k is even, 


The Fourier series of the rectified wave thus reads 


ae 1 
ia a. oe 


_ 


—T 0 wT x 


Figure 3.4. Graphs of the rectified wave (top) and the corresponding polynomials 
S2,¢(x), S10, ¢(2), S'30,¢(x) (bottom) 
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3.3 Exponential form 


The exponential form is an alternative, but equivalent, way of representing a Four- 
ier series; it is more concise and sometimes easier to use, but the price for this is 
that it requires complex numbers. 

The starting point is the Euler identity (Vol. I, Eq. (8.31)) 


e’”” = cos6 + isind , 6 ER; (3.15) 
setting 6 = tkx, k > 1 integer, we may write coskx and sin kz as linear combin- 
ations of the functions e’**, e~***: 

a 1 tka —tikax : 1 tka —tkx 
coska = 5(e +e"), sink = >-(e —e '*). 


On the other hand, 1 = e’” trivially. Substituting in (3.14) and rearranging terms 
yields 


(3.16) 


where 


Expression (3.16) represents the Fourier series of f in exponential form, and the 
coefficients c, are the complex Fourier coefficients of f. 

The complex Fourier series embodies the (formal) expansion of a function f € 
Con with respect to the orthogonal system of functions e***, k € Z. In fact, 


on k=, 
(Cu eo") = TOK] = 
0 ifkdée, 
where ‘5 
(f, 9) = f(x)g(x) dx 
0) 


is defined on C. the set of complex-valued maps f = f, +ifj : R + C whose real 
part f; and imaginary part f,; belong to C2,. 
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For this system the complex Fourier coefficients of f become 


(f, et) 
Ck (etka, the) ( ) 


in analogy to (3.9). Since this formula makes sense for all maps in C3, equa- 
tion (3.16) might define Fourier series on Ci, as well. 

It is straighforward that f € Ce is a real map (f,; = 0) if and only if its 
complex Fourier coefficients satisfy c_, = Cz, for any k € Z. If so, the real Fourier 
coefficients of f are just 


an = Cis Og = Ch + Cog, by Wc, — C4), fork 2 i: (3.19) 


3.4 Differentiation 


Consider the real Fourier series (3.13) and let us differentiate it (formally) term 
by term. This gives 


ao + So (an coskxz + 6, sinkz) , 
k=1 
with 
ao = 0, Ak = kbz , Br = —kaz , for k > 1. (3.20) 


Supposing f € Co, is C! on R, in which case the derivative f’ still belongs to 
C2,, the previous expression coincides with the Fourier series of f’. In fact, 


1 27 : 1 
a0= 52 f(a) de = 5 (Fn) ~ £00) 
by periodicity. Moreover, for k > 1, integrating by part gives 
1 27 ; 
a, = — f(x) cos ka dx 
W JO 


QT 


1 
—|F(2) cos ka 


20 
me f(e)snke dg = kbp, 
1 JO 


and similarly, 6, = —kag. 
In summary, 


if Sok b, coskx — kay sinkz). (3,21) 


A similar reasoning shows that such representation holds under weaker hypotheses 
on the differentiability of f, e.g., if f is piecewise C! (see Definition 3.24). 
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The derivatives’ series becomes all the more explicit if we exploit the complex 
form (3.16). From (3.15) in fact, 
— e’? = (~ sin) + icos6 = i(cos6 + isin#) = ie”. 


Therefore 


whence 


If f is C” on R, r > 1, this fact generalises in the obvious way: 


+00 
gO me S* (hy cue 5 


k=—oo 


Thus, the kth complex Fourier coefficient +, of the rth derivative of f is 


Vk = (tk) "ce. (3.22) 


3.5 Convergence of Fourier series 


This section is devoted to the convergence properties of the Fourier series of a 
piecewise-continuous, periodic map of period 27 (not regularised necessarily). We 
shall treat three kinds of convergence: quadratic, pointwise and uniform. We omit 
the proofs of all theorems, due to their prevailingly technical nature. 


3.5.1 Quadratic convergence 


We begin by the definition. 


Definition 3.14 Let f and fz, k > 0, be square-integrable functions defined 


on a closed and bounded interval |a, b]. The series Ss" fr converges in quad- 
k=0 
ratic norm to f on |a, 6] if 


tie. fe) Do sneo[ee= jm l= 
¢ k=0 k=0 


' A classical text where the interested reader may find these proofs is Y. Katznelson’s, 
Introduction to Harmonic Analysis, Cambridge University Press, 2004. 
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CoO 

Directly from this, the uniform convergence of ye fy to f implies the quadratic 
k=0 


convergence, for 


i 


n 


Fee) fala) aes f ( sup [fle - Yo fala) ao 
k=0 a k=0 


xeE[a,b 


< o-o)|F- Dal 
k=0 a" 


hence, if the last expression is infinitesimal as n — ov, so is the first one. 


Quadratic convergence of the Fourier series of a map in C2, is guaranteed by 
the next fundamental result, whose proof we omit. 


Theorem 3.15 The Fourier series of f € Cox converges to f in quadratic 
norm: 


as If - Sn.f l2 ah 


Let us describe some consequences. 


Corollary 3.16 Any f € Co, satisfies Parseval’s formula: 


+oo0 


PEE 
[fly ae = 2nai + 0 (a + 82). 
0 


k=1 


Proof. This is an easy corollary of the above theorem. By (3.12) in fact, 


2 n 
0= lim ||f —Snfll2= lim (| [f(2)? dx — maj — + > (az + 82) 
al 


n—-+0o0 n—-+0o 


27 oo 
= If (a)|? da — 2mag — w S(ag + bf). 
0 


k= 


Corollary 3.17 (Riemann-Lebesgue Lemma) Given f € Ge, 


lim ap= lim bh =O. 
k—>+00 k—>+00 


[o.@) 
Proof. From Parseval’s identity (3.23) the series So (ag +b;) converges. Therefore 
k=1 
its general term a? + be goes to zero as k + +00, and the result follows. 
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If f € Cx, is expanded in complex Fourier series (3.16), Parseval’s formula 
becomes 


on +00 
| Ra) 2 dx = 2m S> eel, 
(0) 


k=—oo 


while the Riemann-Lebesgue Lemma says 


lim c, =O. 
k- oo 


Corollary 3.16 is useful to compute sums of numerical series. 


Example 3.18 


The map f(x) = 2, defined on (—7,7) and prolonged by periodicity to all R 
(sawtooth wave) (Fig. 3.5), has a simple Fourier series. The map is odd, so 
az, = 0 for all k > 0, and b, = 7 (—1)*1, hence 


fr S- =(-1)+4 sinka. 
k=1 


The series converges in quadratic norm to f(x), and via Parseval’s identity we 


find 
| \t@Pac= noe 
7 k=1 
Since 
T rT? 
i a? dx = : 
/ 
—37 


Figure 3.5. Graphs of the sawtooth wave (left) and the corresponding polynomials 
Si,¢(x), S7,¢(x), S25,¢ (x) (right) 
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we have 


whence the sum of the inverses of all natural numbers squared is 
[o-e) 


Yeas 

727 Re? 

rae 6 

This fact was stated without proof in Example 1.11 i). 


We remark, at last, that additional assumptions of the regularity on f allow to 
estimate the remainder of the Fourier series in terms of n, and furnish informations 
on the speed at which the Fourier series tends to f in quadratic norm. For instance, 
if fe CG. is C” on R, r > 1, it can be proved that 


1 Tr 
If — Salle SUF lla. 


This is coherent with what will happen in Sect. 3.5.4. 


3.5.2 Pointwise convergence 


We saw that the Fourier series of f € Co, converges to f in quadratic norm, so in a 
suitable integral sense; this, though, does not warrant pointwise convergence. We 
are thus left with the hard task of finding conditions that ensure pointwise con- 
vergence: alas, not even assuming f continuous guarantees the Fourier series will 
converge. On the other hand the uniform convergence of the Fourier series implies 
pointwise convergence (see the remark after Definition 2.15). As the trigonometric 
polynomials are continuous, uniform convergence still requires f be continuous (by 
Theorem 2.17). We shall state, without proving them, some sufficient conditions 
for the pointwise convergence of the Fourier series of a non-necessarily continuous 
map. The first ones guarantee convergence on the entire interval [0,27]. But before 
that, we introduce a piece of notation. 


Definition 3.19 i) A function f is called piecewise regular on an interval 
[a, 6] C R in case 


a) it is differentiable everywhere on [a, b| except at a finite number of points 
at most; 
b) it is piecewise continuous, together with its derivative f’. 


ii) f is piecewise monotone if the interval |a, b] can be divided in a finite 
number of sub-intervals where f is monotone. 
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Theorem 3.20 Let f € Co, and suppose one of the following holds: 


a) f is piecewise regular on (0, 27]; 
b) f is piecewise monotone on (0, 27]. 


Then the Fourier series of f converges pointwise to f on (0, 27]. 


The theorem also holds under the assumption that f is not regularised; in such 
a case (like for the square wave or the sawtooth function), 


at a discontinuity point xo, the Fourier series converges to the regularised value 
flag) + f(%9) 
2 
of f, and not to f(xo). 


Now let us see a local condition for pointwise convergence. 


Definition 3.21 A  piecewise-continuous map admits left pseudo- 
derivative and right pseudo-derivative at x) € R if the following 
respective limits exist and are finite 


= f(x) p= aes 
f'(ao) = lam 1) ail 0) f'(at) = lia aa) 


LL XL — XO coat x — XO 


(Fig. 3.6 explains the geometric meaning of pseudo-derivatives.) If in addition f 
is continuous at xo, the pseudo-derivatives are nothing else than the left and right 
derivatives. 

Note that a piecewise-regular function on [0,27] admits pseudo-derivatives at 
each point of [0, 27]; despite this, there are functions that are not piecewise regular 
yet admit pseudo-derivatives everywhere in R (an example is f(x) = x? sin 1, for 
z #0, x € [—7,7 | and f(0) = 0). 


U4 
fat)t an 
f (xo) + 
fles)4 


Figure 3.6. Geometric meaning of the left and right pseudo-derivatives of f at xo 
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Theorem 3.22 Let f € Con. If at xo € (0, 2x] the left and right pseudo- 


derivatives exist, the Fourier series of f at xo converges to the (regularised) 
value f(x). 


Example 3.23 


Let us go back to the square wave of Example 3.11. Condition a) of Theorem 
3.20 holds, so we have pointwise convergence on R. Looking at Fig. 3.3 we can 
see a special behaviour around a discontinuity. If we take a neighbourhood of 
Xo = 0, the x-coordinates of points closest to the maximum and minimum points 
of the nth partial sum tend to x9 as n — oo, whereas the y-coordinates tend to 
different limits 2, ~ +1.18; the latter are not the limits f(0*) = +1 of f at 0. 
Such anomaly appears every time one considers a discontinuous map in Cae: and 
goes under the name of Gibbs phenomenon. 


3.5.3 Uniform convergence 


As already noted, there is no uniform convergence for the Fourier series of a dis- 
continuous function, and we know that continuity is not sufficient (it does not 
guarantee pointwise convergence either). 

Let us then introduce a new class of maps. 


Definition 3.24 A function f € Com is piecewise C! if it is continuous on 


R and piecewise regular on |0, 27]. 


The square wave is not piecewise C! (since not continuous), in contrast to the 
rectified wave. 
We now state the following important theorem. 


Theorem 3.25 Let f € Ca be piecewise C!. Its Fourier series converges 
uniformly to f everywhere on R. 


More generally, the following localization principle holds. 


Theorem 3.26 Let f € Con be piecewise regular on [0, 27]. Its Fourier series 


converges uniformly to f on any closed sub-interval where the map is continu- 
ous. 
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Example 3.27 
i) The rectified wave has uniformly convergent Fourier series on R (Fig. 3.4). 


ii) The Fourier series of the square wave converges uniformly to the function on 
every interval [e,7 —e] or [t+¢,27—e] (0<e< 7 £), because the square wave 
is piecewise regular on [0,27] and continuous on (0,7) and (a, 27). 


3.5.4 Decay of Fourier coefficients 


The equations of Sect. 3.4, relating the Fourier coefficients of a map and its deriv- 
atives, help to establish a link between the coefficients’ asymptotic behaviour, as 
|| —+ co, and the regularity of the function. For simplicity we consider complex 
coefficients. Let f € Co, be of class C” on R, with r > 1; from (3.22) we obtain, 
for any k £0, f 


= |Ves 
|| 


\cx| = 


The sequence |p| is bounded for |k| — 00; in fact, using (3.18) on f‘”) gives 
re, eva) 


Vk = (cike, eikr) } 


the inequality of Schwarz tells 


Folie Io 1 ae 
lye] < ek alle Jo. 


Thus we have proved 


is similarly found using (3.19) or a direct computation. In any case, if f has period 
27 and is C’ on R, its Fourier coefficients are infinitesimal of order at least r with 
respect to the test function 1/|k]. 

Vice versa, it can be proved that the speed of decay of Fourier coefficients 
determines, in a suitable sense, the function’s regularity. 


3.6 Periodic functions with period T > 0 


In case a function f belongs to Cr, ie., is defined on R, piecewise continuous on 
(0, T], regularised and periodic of period T > 0, its Fourier series assumes the form 
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es Dat aE 
f aot » (ax cos ku + by inka) ; 


k=1 


where 
1? 
w= | f(x) dz, 
Deas Qn 
== mus Sal 
ak 7 if f(x) cosk ade, geal 


De 2 
b= 5 | f(a) sink ade, Rooke: 


The theorems concerning quadratic, pointwise and uniform convergence of the 
Fourier series of a map in Co, transfer in the obvious manner to maps in Cr. 
Parseval’s formula reads 


ir 12 co 
/ F@)/? deo = Ta + = (ak +83). 


k=1 


As far as the Fourier series’ exponential form is concerned, (3.16) must be re- 
placed by 


(3.24) 
where 
Parseval’s identity takes the form 
© +00 
|) RePtea bP 
0 k=—00 
Example 3.28 
Let us write the Fourier expansion for f(z) = 1— 2? on IJ = [—1, 1], made 
periodic of period 2. As f is even, the coefficients b,; are zero. Moreover, 
Lf : 2 
ag = > f (-2*)de= [ (l—2*)dr=-, 
a 0 3 
1 1 
= =| (1 — x?) coskra dz = 2 | (1 — x?) coskra dz = (=1'"* 
2 —1 0 k2 72 


for any k > 1. 
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Hence 


2 42 (1 
pete Sy OF coskne, 


Since f is piecewise of class C' the convergence is uniform on R, hence also 
pointwise at every x € R. We can write 


2 4 A (-1)*! 


f@m= sts coskra, VxeER. 


3 on? <= ke 
In particular 
2. 2-3 


whence the sum of the generalised alternating harmonic series can be eventually 
computed 


3.7 Exercises 


1. Determine the minimum period of the following maps: 


a) f(x) =cos(3x — 1) b) f(@)= sin 5 — cos 4x 
c) f(x) =1+ cosa + sin3x d) f(x) =sinzcosz+5 
e) f(x) =1+ cos? x f) f(x) =|cosa|+sin 2x 
Sketch the graph of the functions on R that on (0,7) coincide with f(x) = /x 
and are: 
a) m-periodic; b) 27-periodic, even; c) 2n-periodic, odd. 


Given f(x) = cos? xz + sin3zx — 4, 
a) determine its minimum period; 
b) compute its Fourier series; 
c) study quadratic, pointwise, uniform convergence of such expansion. 


4. Determine the Fourier series expansion of the 27-periodic maps defined, on 
[—7,7], as follows: 


f(x) =1—2cosz + |z| b) f(#)=14+2+4+sin2z 


(ce) | f(x) = 4] sin® 2| [a] s@)={" if —tm<a <0, 


—cosx if 0<a<7 
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5. Determine the Fourier series of the regularised maps of period T = 1 below: 


Ll: ae —i,4 ; 
a) re) ={ = ) b) f@)= |sin2ra| 


x ifx € [0,5], 


7 sin 27x ifx € [0,5], 
) re) ={ l-a« ifze [4,1] 


0 ifxe [3,1] de 


Let f be the T-periodic, piecewise-continuous, regularised function whose 
Fourier series is 


Determine the period T and the symmetries (where present) of f. 


7. Appropriately using the Fourier series of f(x) = x? and g(x) = x, calculate 


the sum of: 
= il 
Qua by Doge 
k=1 k=1 
Determine the Fourier series of the 27-periodic map defined on |—7,7| by 
p(x)| + plx 
j(a) = Weal ee) 


where y(x) = x7 — 1. 


Determine the Fourier series for the 27-periodic, even, regularised function 
defined on [0,7] by 


F(x) roe lores 4 
t)= 

0 yy ae aT: 
Use the expansion to compute the sum of the series 


(oe) 


> ETI? sa 


k= a 


10. Find the Fourier coefficients for the 27-periodic, odd map f(x) = 1+sin2a” + 
sin 4x defined on (0, 7]. 
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Consider the 27-periodic map 


_ fcos2x if |x| < 
fe) = 4° if |x| > 


, 


NIA NIA 


and |x| <a 


defined on |—1,7]. Determine the Fourier series of f and f'; then study the 
uniform convergence of the two series obtained. 


Consider the 27-periodic function that coincides with f(x) = x? on (0, 2z7]. 


Verify its Fourier series is 


fr grt +4 (Ge eowke ~ sink). 


Study the convergence of the above and use the results to compute the sum of 
[o-e) 

—1)k 

the series S- iy) ; 
k=1 


k2 


Consider 


wl 
f(v)=2+5 pp sinks, ceR. 
k=1 


Check that f € C°°(R). Deduce the Fourier series of f’ and the values of || f||2 
27 


and Jide. 
0 


14. Consider the 27-periodic function defined on |—7,7) as f(x) = x. Determine 
the Fourier series of f and study its quadratic, pointwise and uniform conver- 
gence. 


3.7.1 Solutions 


1. Minimum period: 


ay PS or b) T=6r. c) T=2r. 
d) TH7. e) T="; i) f= 7. 


2. Graphs of maps: see Fig. 3.7. 


3. a) The minimum period is T = 27. 
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a) y 


y 
—27 —r ‘0 


hs 0 ; —. 


Figure 3.7. The graphs relative to Exercise 2 


b) Variously using trigonometric identities one gets 


f(x) = —4 + sin 3x + cos x cos” x 


1 
= —4+sin3z + —cosxz(1 + cos 2z) 
2 
1 1 
=-4+ 5 CO8e + sin 3a + 7 (cos 32 + cos x) 


3 1 
=—-4+ 7 cose + sin da + 7 008 8a. 


101 


c) As f is a trigonometric polynomial, the Fourier series is a finite sum of simple 


harmonics. Thus all types of convergence hold. 


4. Fourier series’ expansions: 


a) The function is the sum of the trigonometric polynomial 1 — 2cosa and the 
map g(x) = |z|. It is sufficient to determine the Fourier series for g. The latter 


is even, so by = O for all k > 1. Let us find the coefficients ax: 


se el iL oy” 1 
ee |x| dx - | zdz= >, 


=7- 


os 2 [" 
a, == | jolcoskrde = = | x cos ka dx 
T 7 Jo 


= 


2 r 2 

= =a lcoske + ka sin ka] = Sa —1) 
0 if k even, 

= ae Oe 


4 
a if k odd, 
1 
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In conclusion, 


mr 4en 1 
~ 2k+1 
ae 7 BEI thee 
so “ 
fritt—(2+=) cose =3 cos(2k + 1)x 
#2 Qk + 1) 
oo —1)kt1 
b) feit2sine+2> OO sinker 
k=3 


c) The map is even, so by = 0 for all k > 1. As for the ax: 


T 4 T 4 3 m 
ag = = | a|sin? | de = = | sin’ adx = — = : = cos. 
oT a TT 0 Tv 3 0 
iy ieee S [” 2% 
ee 4| sin’ z|coskadx = — | sin’ xcoska dx 
Tf 7 T JO 
a 
= - [ (3 sin — sin 3x) cos ka dx 
T JO 
2 x : 
—[sin* 2] ifk=1, 
6 [cos2e cos4z]" 1 >. 5. 4% 
le wee |, ~ ge bm 32] — 
~ 9 3 fcos(l+k)x | cos(l—k)x 
mil 2 1+k 1—k 
1 fcos(3—k)x  cos(3+k)x\]"_ 
= fee 1,3 
5 ( Je see i ee 
0 Lk=1,46=3, 
—1 i ik#1,3 
m1 — k2)(9 BB | pe es 
0 if k odd, 
= 96 
—— if k; ; 
TIO LE) if k even 
Overall, 
16 6 1 
_————— — ee i 2k . 
f a oo sin 2ha 


(1 — 4k2)(9 — 4k?) 
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d) Calculating the coefficients az, bp gives: 


1 Tv 


00 On 


b, = — 


2k — 1) 2(k +1) 


= m(k2 — 1) for k even 5 


= 

7 1 

= 7 eae 

i a La a cos(k + a eppere, 
T 

0 for k odd. 


Therefore 


5. Fourier series’ expansions: 


ye 9) sp ae \e 1 
~+-— cos 27 (2k — 1)a 
oa i = _ 


4 1 
b) jo AF gl quem 


Ld 2 1 
c) fe 7 + 5 Sin ene — oie 


Ll 2 1 
d) a ge a 
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6. Since 


1 2 
frl+ 57 sin2e + Toe sinde + 


we have T' = 7, which is the minimum period of the simple harmonic sin 22. 


The function is not symmetric, as sum of the even map g(x) = 1 and the odd 
Co 


k ; 
one h(a) = Dy Qk+13 sin 2kax. 
k=1 


7. Sum of series: 


a) We determine the Fourier coefficients for the 27-periodic map defined as f(x) = 
x? on [—7,7]. Being even, it has b, = 0 for all k > 1. Moreover, 


ir . 
a= = f ede =, 
7 0 3 


om i 2 2 2 / 
a= =f Pooskeds = = |Freoske + (— Fy) sink] 


T 


Therefore 


Parseval’s identity yields 
7 ) [o-2) 1 
ae OG 
j« af = 9” PEED 7a? 


so 


Sark 
k=1 
il 
») Spee. 
k=1 
8. Observe 


Ha) {ro if x € [—-7,-1] U[1, 7], 
t)= 

0 if « € (-1,1) 
(Fig. 3.8). The map is even so b; = 0 for all & > 1. Moreover 


T 2 
2 1 2 
= “des 
ee -[ @ 3 © oe 


y) Tv 
ay = - | (x? — 1) coska dx 
1 


T 
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—27 —T 0 T 20 


Figure 3.8. The graph of f relative to Exercise 8 


2. PA 8 sinka]” 
ie Zs (2k cos + (k°a* — 2) sin kz) — : 
4 k sin k 
=) i —cosk) , ee 1s 
then 
n 2 441 ,. , sink 
peg ge Dag ™ + —— — cosk) cos ka. 


9. For convenience let us draw the graph of f (Fig. 3.9). 
As f is even, the coefficients b, for k > 1 all vanish. The other ones are: 


a 3 
a == f (a —x)dz = <2, 
0 8 


T 
5. 7a sin kx 1 ; 
a, == | (1 —x)coskadx = 2 — —;(cos ka + ka sin ka) 
T Jo ba 1k 0 
ci \ 
T 
| | 2 | | 
| | | | 
: : Z : : 
| | | | 
l | l l — 
—T =< 0 5 7 x 


Figure 3.9. The graph of f relative to Exercise 9 


mw /2 
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‘ra (3 
= 7 sin Sk — 5 cos Sk + Ro IL 


Hence 
30 (1 2 2 
fant) (Fsin Fk Spcos Skt Fy ) cosh, 


The series converges pointwise to f, regularised; in particular, for 7 = 4 


Now, as 
sink {1 if k= 2 cose = {1 0" ifk=2m, 
2 (-1)" ifk=2m+1, 2 0 itk=2m+1, 
we have oY ee ; 
T 
go De Bam (OD “D-Day 
whence 
3 1 te 
— (2k + 1)? 8 
10. We have 
ag =0, Vk > 0; 
be = 4 = 1, bo, H Os Via 8: bomet = Seay: Wm > 0. 


11. The graph is shown in Fig. 3.10. Let us begin by finding the Fourier coefficients 
of f. The map is even, implying b; = 0 for all k > 1. What is more, 


Figure 3.10. The graph of f relative to Exercise 11 
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1 wT 1 a /2 7 1 
ay == f f(x) dx— | cos ard [ dx | =—--, 
T Jo T \ Jo np? 2 


9) T 9) n/2 
c= - | f(x) cos kx da— i cos 2rcos kr dr ~ [ cos kx da 
T JO Tv 0 w/2 


‘ am /2 7 
2 Lees _ ee ir R= D. 
Tv 2 8 0 2 n/2 
2[[sn(2—k)x  sin(QQ+k)z]7? [1 J” 
SF, || eee ee ye perc == sink fork #2 
|| SB BO Mile Ake ads ones 
1 
5 for =o, 
=¢.0 for = 2 n>, 
See 


nN a fork =2m+1,m>0. 
(2m + 1)(4— (2m +1)2) paren aoa 


Hence 


fr 2+ i cos? way uk (2k +1) 
SD OPES Le Ok + 1)(3 = 4k — 4k?) . 


Note the series converges uniformly on R by Weierstrass’ M-test: for k > 0, 


(-1)' 1 
2k + 1)x2) < ————___.__——__ = M 
(Ok +1 — 46 — 46) (O7* + Ye} S Oey ae aka) 
and » M,, converges (like the generalised harmonic series of exponent 3, because 
k=0 


My ~ sh as k — oo). In particular, the Fourier series converges for any x € R 
pointwise. Alternatively, one could invoke Theorem 3.20. 

Instead of computing directly the Fourier coefficients of f’ with the definition, 
we shall check if the convergence is uniform for the derivatives’ Fourier series; thus, 
we will be able to use Theorem 2.19. Actually one sees rather immediately f is 
C(R) with 


; mo si 2a iF la <2. 
f(x) = 
0 ifs <|2| <9, 


while f’ is piecewise C' on R (f” has a jump discontinuity at +3). Therefore the 
Fourier series of f’ converges uniformly (hence, pointwise) on R, and so 


/ ms : 8 . (=1)* : 
f (x) = —sin2z + ir » Ti? Lak 3g Sinlek + 1a, 
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12. We have 


a sec 4 
a= 5 f x “dz = 3°, 


1 21 

a, == | x? cos kx dx 
T JO 
ee fe ny ere k 2 fete) E>1 

= — |= sin GF cos — sin =— 

7k on es he oe mic 
1 20 

== f x? sin kx dx 
7 JO 
aif a y) y) a 4 
— —x? coskx + sprsinke + cos kx ele eal © 
a| k k2 k3 F k 


so the Fourier series of f is the given one. 

The function f is continuous and piecewise monotone on R, so its Fourier series 
converges to the regularised f pointwise (f(2k7) = 2x?, Vk € Z). Furthermore, 
the series converges to f uniformly on all closed sub-intervals not containing the 
points 2ka, Vk € Z. 

In particular, 


2_ 4 9 <1 49 — (—1)* 
iaj=7 = 37 +4) 7 75 coskm = 37 +450 72 
k=1 k=1 

whence : 

= (=1) ee us 

» em a" 37) =~ 
13. The series Ss" 5R sinkx converges to R uniformly because Weierstrass’ M-test 

k=1 


applies with M; = 3 this is due to 


I. 3 1 
\fr(x)| = 5g Sin ka = SR? Va eR. 
Analogous results hold for the series of derivatives: 
k; k; 
\fi.(x)| = 5K COS ke = oe? Vc ER, 
k?2 
fx (2)| = | az sin kx <y weR, 
n k” 
OW <a, VeeR 
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R uniformly; by 


k=0 


Consequently, for all n > 0 the series y F(a) converges on 
Theorem 2.19 the map f is differentiable infinitely many times: f € C(I). In 


particular, the Fourier series of f’ is 


f@=)>, 


k=1 


ie Va eR. 


5 coskz , 


NO 


To compute || f||2 we use Parseval’s formula 
IIB = f° [s(e)P de = 2xay +e (ah +88) =4n te 
k=1 k=1 
13 


us 
—1)=4 >~=s 
) slemar 3? 


itn ( 
= 47 + 7 T 
l-q 


from which ||f||2 = 54/487. At last, 


21 
(2) de = 27ag=4r. 
0 
14. We have 
= (=I 
f x2 S- —— = sin kx 


This converges quadratically; it converges pointwise to the regularised map coin- 
ciding with f for x £4 7+ 2kz and equal to 0 for x = 7 + 2k7z, k € Z; it converges 
uniformly on every closed interval not containing 7 + 2k7, k € Z. 


A 


Functions between Euclidean spaces 


This chapter sees the dawn of the study of multivariable and vector-valued func- 
tions, that is, maps between the Euclidean spaces R” and R™ or subsets thereof, 
with one of n and m bigger than 1. Subsequent chapters treat the relative differ- 
ential and integral calculus and constitute a large part of the course. 

To warm up we briefly recall the main notions related to vectors and matrices, 
which students should already be familiar with. Then we review the indispensable 
topological foundations of Euclidean spaces, especially neighbourhood systems of 
a point, open and closed sets, and the boundary of a set. We discuss the properties 
of subsets of R”, which naturally generalise those of real intervals, highlighting the 
features of this richer, higher-dimensional landscape. 

We then deal with the continuity features of functions and their limits; despite 
these extend the ones seen in dimension one, they require particular care, because 
of the subtleties and snags specific to the multivariable setting. 

At last, we start exploring a remarkable class of functions describing one- and 
two-dimensional geometrical objects — present in our everyday life — called curves 
and surfaces. The careful study of curves and surfaces will continue in the second 
part of Chapter 6, at which point the differential calculus apparatus will be avail- 
able. The aspects connected to integral calculus will be postponed to Chapter 9. 


4.1 Vectors in R” 


Recall R” is the vector space of ordered n-tuples 2 = (%;)j=1,....n, called vectors. 
The components x; of « may be written either horizontally, to give a row vector 


PS (Kiyecta tn) 
or vertically, producing a column vector 


Ly 


In 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_4, 
© Springer International Publishing Switzerland 2015 
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These two expressions will be equivalent in practice, except for some cases where 
writing vertically or horizontally will make a difference. For typesetting reasons 
the horizontal notation is preferable. 

Any vector of R” can be represented using the canonical basis {e1,...,en} 
whose vectors have components all zero except for one that equals 1 


1 ifi=j 


0 fit (4-1) 


Ee, = (Op jieyen where One = { 


Then 


2= s Lie; . (4.2) 
i=l 


The vectors of the canonical basis are usually denoted by 7, 7 in R? and i, 7, k in 
R°. It can be useful to identify a vector (%1,@2) =@1t+ Loj € R? with the vector 
(v1, 22,0) = v1 + raj + Ok € R?: the expression x17 + x27 will thus indicate a 
vector of R? or R°, according to the context. 

In R” the dot product of two vectors is defined as 


n 
ey = iyi t--.+ontn = > titi 
w=1 


This in turn defines the Euclidean norm 


|2| = vee = 


for which the Cauchy-Schwarz inequality 


ja -y| < lal Iyll (4.3) 
and the triangle inequality 
|e + yl] < lle] + Ilyll (4.4) 


hold. Two vectors x and y satisfying «-y = 0 are called orthogonal, and a 
vector x such that ||a|| = 1 is said a unit vector, or of length 1. The canonical 
basis of R” is an example of an orthonormal system of vectors, i.e., a set of n 
normalised and pairwise orthogonal vectors; its elements satisfy in fact 


e+e) = is LS 0) So ti 
From (4.2) we have 
G,=@:e; 


for the 7th component of a vector x. 
It makes sense to associate « € R” to the unique point P in Euclidean n- 
space whose coordinates in an orthogonal Cartesian frame are the components of 
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x; this fact extends what we already know for the plane and space. Under this 
identification ||a|| is the Euclidean distance between the point P, of coordinates 
x, and the origin O. The quantity |la — y|| = (a1 —y1)? +...+(@n — yn)? is 
the distance between the points P and Q of respective coordinates x# and y. 


In R? the cross or wedge product x A y of two vectors « = 211+ %2j + 23k 
and y = yii + yoj + y3k is the vector of R® defined by 


rAY= (xoy3 — L3Y2)t oP (r3y1 7 ©1Y3)J a (t1Y2 — r2y1)k. (4.5) 


It can also be computed by the formula 


z\y=det| 271 x2 23 (4.6) 
Yr Yo YB 


by expanding the determinant formally along the first row (see definition (4.11)). 
The cross product of two vectors is orthogonal to both (Fig. 4.1, left): 


(eAy) 2 =0: (aAy)-y=0. (4.7) 


The number ||x/\y|| is the area of the parallelogram with the vectors x, y as sides, 
while ||(a A y) - z|| represents the volume of the prism of sides x, y, z (Fig. 4.1). 
We have some properties: 


y\n=-(x%Ay), 
ezA\y=0 <= «=dy for some, €R, (4.8) 
(aty)Az=2Az+YAzZ, 


the first of which implies x A x = 0. 
Furthermore, 


these and previous ones prove for instance that if # = 714+ 2097, y = yit + Yog 
then x A Y= (x1Y2 = x2y1)k. 


xy 


' 


y 


<<< ae 


ob xv 


Figure 4.1. Wedge products x A y (left) and (a A y) - z (right) 
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A triple (v1, v2,v3) of pairwise non-aligned, unit vectors is said positively 
oriented (or right-handed) if 


(v1 A v2)- v3 >0, 


i.e., if vg and vy; A v2 lie on the same side of the plane spanned by v; and vg. A 
practical way to decide whether a triple is positively oriented is to use the right- 
hand rule: the pointing finger is v1, the long finger v2, the thumb v3. (The left 
hand works, too, as long as we take the long finger for v,, the pointing finger for 
v2 and the thumb for v3.) 

The triple (v1, v2, vs) is negatively oriented (or left-handed) if (vi, v2, —v3) 
is oriented positively 

(v1 A v2) + U3 < . 


4.2 Matrices 


A real matrix A with m rows and n columns (an m x n matrix) is a collection of 
m x n real numbers arranged in a table 


a11 9412 Gin 

a21 a22 a2n 
A= 

AGm1 Am2 +--+ Amn 


or, more concisely, 
—_ mn 
A = (aij)i<icm €R”. 
1<j<n 


The vectors formed by the entries of one row (resp. column) of A are the row 
vectors (column vectors) of A. Thus m x 1 matrices are vectors of R™ written as 
column vectors, while 1 x n matrices are vectors of R” seen as row vectors. When 
m =n the matrix is called square of order n. The set of m x n matrices is a 
vector space: usually indicated by R’™”, it is isomorphic to the Euclidean space 
R™”. The matrix C = AA+ uB, », w € R, has entries 


Cig = Adi + pi; , l<i<m,1l<j<n. 


If Aismxnand B is n x p, the product C = AB has by definition m rows and 
p columns; its generic entry is the dot product of a row vector of A with a column 
vector of B: 


In particular, if B = a is n x 1, hence a column vector in R”, the matrix product 
with the vector Az is well defined: it is a column vector in R™. If p =m, we have 
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square matrices AB of order m and BA of order n. If n = m, AB # BA in 
general, because the product of matrices is not commutative. 

The first minor A;; (1 <i <m,1< Jj <n) of an m xn matrix A is the 
(m —1) x (n — 1) matrix obtained erasing from A the ith row and jth column. 

The rank r < min(m,n) of A is the maximum number of linearly independent 
rows thought of as vectors of R” (or linearly independent columns, seen as vectors 
of R™). 

Given an m Xx n matrix A, its transpose is the n x m matrix A’ with entries 

=o, lsten, lage 

otherwise put, AZ is obtained from A by orderly swapping rows and columns; 
clearly, (A)? = A. Whenever defined, the product of matrices satisfies (AB)? = 
B" A’. The dot product of vectors in R” is a special case of matrix product 
x:-y=a2' y=y’' a, provided we think x and y as column vectors. 

In R™:” there is a norm, associated to the Euclidean norm, 


|| Al] = max{||Aa|]: 2 € R”, |x|] = 1}, (4.9) 
that satisfies the inequalities 
Aa] < ||Al||fe|| and || AB|| < || Al] |B | (4.10) 


for any « € R” and Be R””. 


Square matrices 

From now on we will consider square matrices of order n. Among them a par- 
ticularly important one is the identity matrix I = (6j;)1<i,j;<n, which satisfies 
AI = IA = A for any square matrix A of order n. A matrix A is called sym- 
metric if it coincides with its transpose 


Gay = Ge, 1l<ij7 <n; 


it is normal if AA’ = A? A, and in particular orthogonal if AA? = A’ A= TI. 

To each square matrix A one can associate a number det A, the determinant 
of A, which may be computed recursively using Laplace’s rule: det A = a ifn =1 
and A = (a), whereas for n > 1 


det A = So (-1)' Mai; det Aj; ; (4.11) 
j=l 
where i € {1,...,n} is arbitrary, but fixed. For instance 


det (: a — ad—be. 
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The following properties hold: 


det(AB) = det A det B, det(AA) = A” det A, 
det A? = det A, det A’ = —det A 


if A’ is A with two rows (or columns) exchanged. Immediate consequences are 
that det J = 1 and |det A| = 1 for A orthogonal. 

The matrix A is said non-singular if det A # 0. This is equivalent to the 
invertibility of A, i.e., the existence of a matrix A! of order n, called the inverse 
of A, such that 

AA'=A'A=I. 


If A and B are invertible, 
det(A7') = (det A)~", (AB)'!=B At, (A*)-'=(A"!); 


from the last equation it follows, as special case, that the inverse of a symmetric 
matrix is still symmetric. Every orthogonal matrix A is invertible, for A~! = A’. 
There are several equivalent conditions to invertibility, among which: 


the rank of A equals n; 
the linear system Ax = 6 has a solution a € R” for any b € R”; 
the homogeneous linear system Aw = 0 has only the trivial solution x = 0. 


The last one amounts to saying that 0 is no eigenvalue of A. 


Eigenvalues and eigenvectors 
The eigenvalues of A are the zeroes (in C) of the characteristic polynomial 
of degree n 

x(A) = det(A — AT). 


In other words, the eigenvalues are complex numbers A for which there exists a 
vector v # 0, called (right) eigenvector of A associated to \, such that 


Av = \v (4.12) 


(here and henceforth all vectors should be thought of as column vectors). The 
Fundamental Theorem of Algebra predicts the existence of p distinct eigenvalues 
AY, ...,A®) with 1 < p < n, each having algebraic multiplicity (as root of the 
polynomial y) uw, such that w +... +p) = n. The maximum number of 
linearly independent eigenvectors associated to \ is the geometric multiplicity 
of \; we denote it by m™, and observe m“ < yw. Eigenvectors associated to 
distinct eigenvalues are linearly independent. Therefore when the algebraic and 
geometric multiplicities of each eigenvalue coincide, there are in total n linearly 
independent eigenvectors, and thus a basis made of eigenvectors. In such a case 
the matrix is diagonalisable. 

The name means that the matrix can be made diagonal. To see this we number 
the eigenvalues by Az, 1 < k < n for convenience, repeating each one as many times 
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as its multiplicity; let vz be an eigenvector associated to Az, chosen so that the 
set {vx}i<k<n is made of linearly independent vectors. The equations 


Avy, = AKvk , LenS eh, 


can be rewritten in matrix form 


AP =PA, (4.13) 
where A = diag (A1,..., An) is the diagonal matrix with the eigenvalues as entries, 
and P = (vj,...,Un) is the square matrix of order n whose columns are the 


eigenvectors of A. The linear independence of the eigenvectors is equivalent to the 
invertibility of P, so that (4.13) becomes 


A=PAP', (4.14) 


and consequently A is similar to a diagonal matrix. 

Returning to the general setting, it is relevant to notice that as A is a real 
matrix (making the characteristic polynomial a real polynomial), its eigenvalues 
are either real or complex conjugate; the same is true for the corresponding eigen- 
vectors. The determinant of A coincides with the product of the eigenvalues 


det A = AyA2°-°:An—1An- 


The eigenvalues of A? = AA are the squares of those of A, with the same eigen- 
vectors, and the analogue fact will hold for the generic power of A. At the same 
time, if A is invertible, the eigenvalues of the inverse matrix are the inverses of 
the eigenvalues of A, while the eigenvectors stay the same; by assumption in fact, 
(4.12) is equivalent to 
a4 = 1 
v=AAv, SO A ES Ok 

The spectral radius of A is by definition the maximum modulus of the ei- 

genvalues 
A) = Agel 
p(A) = max. |x 


The (Euclidean) norm of A satisfies 


|All = (ATA); 


a special case is that of symmetric matrices A, for which ||A|] = p(A); if A is 
additionally orthogonal, then || A|| = \/p(Z) = 1. 


Symmetric matrices 

Symmetric matrices A have pivotal properties concerning eigenvalues and eigen- 
vectors. Each eigenvalue is real and its algebraic and geometric multiplicities coin- 
cide, rendering A always diagonalisable. The eigenvectors, all real, may be chosen 
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to form an orthonormal basis (in fact, eigenvectors relative to distinct eigenvalues 
are orthogonal, those with same eigenvalue can be made orthonormal); the matrix 
P associated to such a basis is thus orthogonal, so (4.14) reads 


A=PAP'. (4.15) 


Hence the transformation «+> P! ax = y defines an orthogonal change of basis in 
R”, from the canonical basis {e;}1<i<n to the basis of eigenvectors {vg }i<k<n; in 
this latter basis A is diagonal. The inverse transformation is yi» Py = a. 


Every real symmetric matrix A is associated to a quadratic form Q, which 
is a function Q : R” > R satisfying Q(Ax) = \2Q(ax) for any x € R”, A € R; to 
be precise, 


1 1 
Oe) = se" Ag = 5v Aa. (4.16) 


The eigenvalues of A determine the quadratic form Q: substituting to A the 
expression (4.15) and setting y = P’ a, we get 
1 1 
Q(a) = su PA Ps = 5 (Pix)! A(P* a) = xy Ay. 


Since Ay = (Anya )i<k<n if y = (YR)i<k<n, we conclude 


Q(a) == 50 AnyR- (4.17) 
k=1 


Nlre 


Consequently, one can classify A according to the sign of Q: 


e Ais positive definite if Q(x) > 0 for any z € R”, x F 0; equivalently, all 
eigenvalues of A are strictly positive. 

e A is positive semi-definite if Q(x) > 0 for any x € R”; equivalently, all 
eigenvalues of A are non-negative. 

e A is indefinite if Q assumes on R” both positive and negative values; this is 
to say A has positive and negative eigenvalues. 


The notion of negative-definite and negative semi-definite matrices are clear. 


Positive-definite symmetric matrices may be characterised in many other equi- 
valent ways. For example, all first minors of A (those obtained by erasing the same 
rows and columns) have positive determinant. In particular, the diagonal entries 
Qi; are positive. 

A crucial geometrical characterisation is the following: if A is positive definite 
and symmetric, the level sets 


{x €R” : Q(x) =c> 0} 


of Q are generalised ellipses (e.g., ellipses in dimension 2, ellipsoids in dimension 
3), with axes collinear to the eigenvectors of A. 
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Y2 X2 
Vi 


Figure 4.2. The conic associated to a positive-definite symmetric matrix 


In fact, (restricting to dimension two for simplicity) the equation 


1 2 2 
5 ryt + A2¥2) =C, Le., a 4 2 = 


(2c/A1) — (2e/Az) 


defines an ellipse in canonical form with semi-axes of length \/2c/A,; and \/2c/A2 

in the coordinates (y1,y2) associated to the eigenvectors v;, v2. In the original 

coordinates (21,22) the ellipse is rotated in the directions of v1, v2 (Fig. 4.2). 
Equation (4.17) implies easily 


de 
Q(x) 2 Fle’,  VYaeR", (4.18) 


where A, = min Ax. In fact, 
1<k<n 


rh 
| 
— 


rx ie 
Q(x) = a 2a 
k=l 


and |ly||? = ||P? al|? = 2? PP’ az = 27 2 = |||? as P is orthogonal. 


Example 4.1 


Take the symmetric matrix of order 2 


“() 


with a a real parameter. Solving the characteristic equation det(A — AI) = 
(4 — \)(2 — A) — a? = 0, we find the eigenvalues: 


Ay =3-V1l+a?, A2 =34+VJV1lt+a?>0. 


la] < 2/2 => >0 Ss. A positive definite , 


Then 


la| = 2/2 => \,=0 = A positive semi-definite , 


la} >2V2 => <0 = Aé indefinite. o 
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4.3 Sets in R” and their properties 


The study of limits and the continuity of functions of several variables require that 
we introduce some notions on vectors and subsets of R”. 
Using distances we define neighbourhoods in R”. 


Definition 4.2 Take x) € R” and let r > 0 be a real number. One calls 
neighbourhood of Xo of radius r the set 


B,(x#o) = {x € R”: |la — aol| < r} 


consisting of all points in R” with distance from ao smaller than r. The set 


B,(xo) = {a € R”: |la — agl| < r} 


is called closed neighbourhood of Xo of radius r. 


Therefore B,.(a) is the disc (n = 2) or the ball (n = 3) centred at xo of radius r, 
while B,(ao) is a disc or ball without boundary. 


If X is a subset of R”, by @X = R” \ X we denote the complement of X. 


Definition 4.3 A point x € R” is called 


i) an interior point of X if there is a neighbourhood B,(x) contained in 
DS 


ii) an exterior point of X if it belongs to the interior of @X; 
iii)a boundary point of X if it is neither interior nor exterior for X . 


Fig. 4.3 depicts the various possibilities. 

Boundary points can also be defined as the points whose every neighbourhood 
contains points of X and @X alike. It follows that X and its complement have 
the same boundary set. 


Definition 4.4 The set of interior points of X forms the interior of X, 
° 
denoted by X or int X. Similarly, boundary points form the boundary of 


X, written OX. Exterior points form the exterior of X. Eventually, the set 
X UOX is the closure of X, written X. 


To be absolutely accurate, given a topological space X the set OX defined above 
should be called ‘frontier’, because the term ‘boundary’ is used in a different sense 
for topological manifolds (defining the topology of which is a rather delicate mat- 
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“3 


Figure 4.3. An interior point 21, a boundary point x2, an exterior point a3 of a set X 


ter). Section 6.7.3 will discuss explicit examples where X is a surface. We shall 
not be this subtle and always speak about the boundary OX of X. 
From the definition, 


SCX CX. 


When one of the above inclusions is an equality the space X deserves one of the 
following names: 


° 
Definition 4.5 The set X is open if all its points are interior points, X = 


X. It is closed if it contains its boundary, X = X. 


A set is then open if and only if it contains a neighbourhood of each of its points, 
i.e., when it does not contain boundary points. Consequently, X is closed precisely 
if its complement is open, since a set and its complement share the boundary. 


Examples 4.6 
i) The sets R” and @ are simultaneously open and closed (and are the only such 
subsets of R”, by the way). Their boundary is empty. 


ii) The half-plane X = {(a, y) € R? : x > 1} is open. In fact, given (x0, yo) € X, 
any neighbourhood of radius r < zo — 1 is contained in X (Fig. 4.4, left). The 
boundary OX is the line x = 1 parallel to the y-axis (Fig. 4.4, right), hence the 
complementary set @X = {(x,y) € R? : x < 1} is closed. 


Any half-plane X C R? defined by an inequality like 
ax+by>c or az + by <c, 
with one of a, b non-zero, is open; inequalities like 
ax+by>c or ax+by<c 
define closed half-planes. In either case the boundary OX is the line ax + by = c. 
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Ya Ya 


Ox 


Figure 4.4. The boundary (right) of the half-plane x > 1 and the neighbourhood of a 
point (left) 


iii) Let X = B,(0) = {a € R”: ||a|| < 1} be the n-dimensional ball with centre 
the origin and radius 1, without boundary. It is open, because for any a € X, 
B,(ao) C X if we set r <1 — |lao||: for ||a — xo|| <r in fact, we have 


||] = |e — eo + aol < ||a — aol] + |leol] < 1 — ||oll + |lxoll = 1. 


The boundary 0X = {a € R”: |\a|| = 1} is the ‘surface’ of the ball (the 
n-dimensional sphere), while the closure of X is the ball itself, i.e., the closed 
neighbourhood B,(0). 

In general, for arbitrary x € R” and r > 0, the neighbourhood B,.(aq) is open, 


it has boundary {a € R” : ||a—ao|| =r}, and the closed neighbourhood B,.(xo) 
is the closure. 


iv) The set X = {a € R”: 2 < ||x—axo|| < 3} (an annulus for n = 2, a spherical 
shell for n = 3) is neither open nor closed, for its interior is 


X = {x €R”":2< ||x — xol| < 3} 


and the closure 


X = {x ER" :2<||a—20|| <3}. 


The boundary, 
OX ={x ER": |x — xo|| = 2} U {a € R”: |x — xo|| = 3}, 


is the union of the two spheres delimiting X. 


v) The set X = [0,1]? 9 @ of points with rational coordinates inside the unit 
square has empty interior and the whole square (0, 1]? as boundary. Since rational 
numbers are dense in R, any neighbourhood of a point in {0, 1]? contains infinitely 
many points of X and infinitely many of the complement. 
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Definition 4.7 A point x € R” is called a limit point of X if each of its 
neighbourhoods contains points of X different from a: 


Vie OF (Bey ar yen Xe ge 


A point « € X is isolated if there exists a neighbourhood B,(a) not contain- 
ing points of X other than a: 


fe 0), (Baya yee 0 


or equivalently, 
slp 0, Bek Ae. 


Interior points of X are certainly limit points in X, whereas no exterior point can 
be a limit point. A boundary point of X must necessarily be either a limit point 
of X (belonging to X or not) or an isolated point. Conversely, a limit point can 
be interior or a non-isolated boundary point, and an isolated point is forced to lie 
on the boundary. 


Example 4.8 


Consider X the set X = {(x,y) € R?: ve <1 or 2? +(y—1)? <0}. 


Requiring ve < 1, hence 4 < 1, defines the region lying between the lines 


y = +a (included) and containing the x-axis except the origin (due to the 
denominator). To this we have to add the point (0,1), the unique solution to 
x? +(y—1)? <0. See Fig. 4.5. 

The limit points of X are those satisfying y? < x?. They are either interior, when 
y? < x, or boundary points, when y = +2 (origin included). An additional 
boundary point is (0,1), also the only isolated point. 


Figure 4.5. The set X = {(a,y) € R’: ve < lor 2*+(y—1)? < 0} 


124 4 Functions between Euclidean spaces 
Let us define a few more notions that will be useful in the sequel. 


Definition 4.9 A set X is bounded if there exists a real number R > 0 


such that 
lel <R, vee X, 


i.e., if X is contained in a closed neighbourhood BR(O) of the origin. 


Definition 4.10 A set X is said compact if it is closed and bounded. 


Examples 4.11 


i) The unit square X = [0,1]? C R? is compact; it manifestly contains its bound- 
ary, and if we take x € X, 


||| = \/2? +03 < VIF 1 = V3. 
In general, the n-dimensional cube X = [0,1]" C R” is compact. 


ii) The elementary plane figures (such as rectangles, polygons, discs, ovals) and 
solids (tetrahedra, prisms, pyramids, cones and spheres) are all compact. 


iii) The closure of a bounded set is compact. 


Let a, 6 be distinct points of R”, and call S|a, b] the (closed) segment with 
end points a, b, i.e., the set of points on the line through a and b that lie between 
the two points: 

Sla,b] = {x#=a+t(b—a):0<t<1} 


(4.19) 
={e=(1—-that+tb:0<t<1}. 


Definition 4.12 A set X 1s called convex if the segment between any two 


points of X is all contained in X. 


Given r+1 points ao, ai1,...,@, in R”, all distinct (except possibly for ap = a,), 
one calls polygonal path of vertices ao, a,,...,a, the union of the r segments 
Sla;-1, ai], 1 <i <r, joint at the end points: 

Tr 


Pl Qiae5 Ge] = UJ eller ae eee cle 
i=1 


Definition 4.13 An open set A C R” is (path-)connected if, given two 


arbitrary points x, y in A, there is a polygonal path joining them that is 
entirely contained in A. 


Figure 4.6 shows an example. 
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Figure 4.6. A connected (but not convex) set A 


Examples 4.14 


i) The only open connected subsets of R are the open intervals. 


ii) An open and convex set A in R” is obviously connected. 


iii) Any annulus A = {a € 


R22 r1 < ||z—ao|| < re}, with a € R?, r2 > 171 > 0, 


is connected but not convex (Fig. 4.7, left). Finding a polygonal path between 
any two points x, y € A is intuitively quite easy. 


iv) The open set A = {a = (x,y) € R?: xy > 1} is not connected (Fig. 4.7, 


right). 


0 


It is a basic fact that an open non-empty set A of R” is the union of a family 
{Ai}iex of non-empty, connected and pairwise-disjoint open sets: 


A=|(JAi 


1EL 


with A;NA;=@forif#). 


Each A; is called a connected component of A. 


zy=1 y 


Figure 4.7. The set A of Examples 4.14 iii) (left) and iv) (right) 
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Every open connected set has only one connected component, namely itself. 
The set A of Example 4.14 iv) has two connected components 


A, = {x =(a#,y)€A:2>0} and Ag={x=(2,y)€A:2<O}. 


Definition 4.15 We call region any subset R of R" made by the union of 
a non-empty, open, connected set A and part of the boundary 0A 


Ro ALS with 0CZCOA. 


When Z = @) the region is open, when Z = OA it is closed. 
A region may be defined equivalently as a non-empty set R of R” whose interior 
° 


A =R is connected and such that A C R C A. 


Example 4.16 
The set R = {a € R?: \/4— 2? — y? < 1} is a region in the plane, for 
R= {a € R?: V3 < |la|| <2}. 


Therefore A = R is the open annulus {x € R? : V3 < ||x|| < 2}, while Z = {x € 
R?: ||x|| = 2} c OA. 


4.4 Functions: definitions and first examples 


We begin discussing real-valued maps, sometimes called (real) scalar functions. 
Let n be a given integer > 1. A function f defined on R” with values in R is a 
real function of n real variables; if dom f denotes its domain, we write 


f:domf CR” >R. 


The graph I'(f) = {(a, f(a)) € R"t! : w € dom f} is a subset of R"*1. 

The case n = 1 (one real variable) was dealt with in Vol. I exhaustively, so in 
the sequel we will consider scalar functions of two or more variables. If n = 2 or 3 
the variable x will also be written as (x,y) or (x,y, z), respectively. 


Examples 4.17 


i) The map z = f(z, y) = 2x — 3y, defined on R?, has the plane 2x — 3y —z =0 
as graph. 

eae 
ye — we 


iii) The function w = f(x,y, z) = V/z — x2? — y? has domain dom f = {(z, y, z) € 
R? : z > x2? + y}, which is made of the elliptic paraboloid z = x? + y? and the 
region inside of it. 


ii) The function z = f(x,y) = is defined on R? without the lines y = +2. 
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iv) The function f(x) = f(11,...,¢%n) = log(1 — x? —... — x2) is defined inside 


nr 
the n-dimensional sphere #?+...+22 =1,sincedom f= {a ER": 2? <1). 
1 n a 
i=1 O 


In principle, a scalar function can be drawn only when n < 2. For example, 
f(x,y) = V9 — x2 — y? has graph in R® given by z = \/9 — x2 — y?. Squaring the 
equation yields 


A hence e+ y?+227=9. 


We recognise the equation of a sphere with centre the origin and radius 3. As 
z > 0, the graph of f is the hemisphere of Fig. 4.8. 


z=9-a“7-y 


Another way to visualise a function’s behaviour, in two or three variables, is 
by finding its level sets. Given a real number c, the level set 


Ky ote edom fj -f(r)— cy (4.20) 


is the subset of R” where the function is constant, equal to c. Figure 4.9 shows 
some level sets for the function z = f(x,y) of Example 4.9 ii). 

Geometrically, in dimension n = 2, a level set is the projection on the xy-plane 
of the intersection between the graph of f and the plane z = c. Clearly, L(f,c) is 
not empty if and only if c € im f. A level set may have an extremely complicated 
shape. That said, we shall see in Sect. 7.2 certain assumptions on f that guarantee 
L(f,c) consists of curves (dimension two) or surfaces (dimension three). 


Consider now the more general situation of a map between Euclidean spaces, 
and precisely: given integers n,m > 1, we denote by f an arbitrary function on 
R” with values in R™ 


f : dom f CR” —R”. 


Figure 4.8. The function f(x,y) = \/9 — x? — y? 
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y 
A 


x 
2 2 

Figure 4.9. Level sets for z = f(x,y) = v + - 
y2— 2x 


If m = 1, we have the scalar functions of above. If m > 2 we shall say f is a real 
vector-valued function. 

Let us see some interesting cases. Curves, seen in Vol. I for m = 2 (plane 
curves) and m = 3 (curves in space), are special vector-valued maps where n = 1; 
surfaces in space are vector-valued functions with n = 2 and m = 3. The study of 
curves and surfaces is postponed to Sects. 4.6, 4.7 and Chapter 6 in particular. In 
the case n = m, f is called a vector field. An example with n = m = 3 is the 
Earth’s gravitational field. 

Let fi, 1 <7 <m, be the components of f with respect to the canonical basis 
1e:}iziem of R™: 


Each f; is a real scalar function of one or more real variables, defined on dom f 
at least; actually, dom f is the intersection of the domains of the components of f. 


Examples 4.18 
i) Consider the vector field on R? 


The best way to visualise a two-dimensional field is to draw the vector cor- 
responding to f(x,y) as position vector at the point (x,y). This is clearly not 
possible everywhere on the plane, but a sufficient number of points might still 
give a reasonable idea of the behaviour of f. Since f(1,0) = (0,1), we draw 
the vector (0,1) at the point (1,0); similarly, we plot the vector (—1,0) at (0,1) 
because f(0,1) = (—1,0) (see Fig. 4.10, left). 

Notice that each vector is tangent to a circle centred at the origin. In fact, the 
dot product of the position vector « = (x,y) with f(a) is zero: 


x- f(x) =(2,y)-(-y,2) = -rzy+ ary =0, 
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Figure 4.10. The fields f(x,y) = (—y, x) (left) and f(x,y, z) = (0,0, z) (right) 


making x and f(a) orthogonal vectors. Additionally, ||f(a)|| = ||a||, so the 
length of f(x, y) coincides with the circle’s radius. 
This vector field represents the velocity of a wheel spinning counter-clockwise. 


ii) Vector fields in R® can be understood in a similar way. Figure 4.10, right, 
shows a picture of the vector field 

f(«,y, 2) = (0,0,z). 
All vectors are vertical, and point upwards if they lie above the xy-plane, down- 
wards if below the plane z = 0. The magnitude increases as we move away from 
the ry-plane. 


iii) Imagine a fluid running through a pipe with velocity f(z, y,z) at the point 
(x,y,z). The function f assignes a vector to each pont (x,y,z) in a certain 
domain 2 (the region inside the pipe) and so is a vector field of R®, called the 
velocity vector field. A concrete example is in Fig. 4.11. 


Figure 4.11. The velocity vector field of a fluid moving in a pipe 
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iv) The R?-valued map 

2 y Zz x 
fon) = (aaa aRe GT aa) * Ta 
on R3\ {0} represents the electrostatic force field generated by a charged particle 
placed in the origin. 


v) Let A = (a;;) 1 
1 


m be areal m x n matrix. The function 
nm 


f(x) = Aa, 


is a linear map from R” to R™. 


4.5 Continuity and limits 


The notion of continuity for functions between Euclidean spaces is essentially the 
same as what we have seen for one-variable functions (Vol. I, Ch. 3), with the 
proviso that the absolute value of IR must be replaced by an arbitrary norm in R” 
or R™ (which we shall indicate with || - || for simplicity). 


Definition 4.19 A function f : dom f C R” > R” is said continuous at 
zo € dom f if for any € > 0 there exists a 6 > 0 such that 


Vaedomf, = |jw—aol]l<d =>  ||fF(x) — fleo)ll <e; 


that is to say, 
Ve € dom f, £e Bsa) = f(x) e Be( fiero): 


A map f is continuous on a set 2 C dom f if it is continuous at each point 
ae (2. 


The following result is used a lot to study the continuity of vector-valued func- 
tions. Its proof is left to the reader. 


Proposition 4.20 The map f = (fi)i<i<m is continuous at ao € dom f if 


and only if all its components f; are continuous. 


Due to this result we shall merely provide some examples of scalar functions. 


Examples 4.21 
i) Let us verify f : R? > R, f(x) = 221+ 522 is continuous at x = (3,1). Using 
the fact that |y;| < ||y|| for all 7 (mentioned earlier), we have 
|f(x) — f(@o)| = |2@1 — 3) + 5(a2 — 1] < 2lar — 3] + 5]x2 — 1] < 7a — aol]. 
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Given € > 0 then, it is enough to choose 6 = €/7 to conclude. The same argument 
shows f is continuous at each a € R?. 


ii) The above function is an affine map f : R” > R, ie., a map of type f(x) = 
a-x+b(aéR", b€R). Affine maps are continuous at each a € R” because 


|f(x) — f(@o)| = |a- a —a-ao| = |a-(% — x0)| < |lal| ]a— aol] (4.21) 
by the Cauchy-Schwarz inequality (4.3). 
If a = 0, the result is trivial. If a 4 0, continuity holds if one chooses 6 = ¢/|la| 
for any given € > 0. 


The map f is uniformly continuous on (2 if we may choose 6 independently 
of 2 in the above definition; this is made precise as follows. 


Definition 4.22 A function is said uniformly continuous on 92 C dom f 
if for any « > 0 there exists a 6 > 0 such that 


Va’, 2” € QQ, lla ae 6 ie fale. (4.22) 


For instance, the above affine function f is uniformly continuous on R”. 


A continuous function on a closed and bounded set (i.e., a compact set) (2 is 
uniformly continuous therein (Theorem of Heine-Cantor, given in Appendix A.1.3, 
p: 515). 


Often one can study the continuity of a function of several variables without 
turning to the definition. To this end the next three criteria are rather practical. 


i) If the map is defined and continuous on a set I CR, then 
f(x) = (21) 


is defined and continuous on Q =I x R"-1 CR”. 
In general, any continuous map of m variables is continuous if we think of it 
as a map of n > ™m variables. 


For example, the following functions are continuous: 
R2 
on dom fo = R x [-1,1] x R. 


fila, y) = e* on dom fi 
fo(a, y,2) = La? 


ii) If f and g are continuous on 2 C R", then also f +g, f —g and fg are 


continuous on 2, while f/g is continuous on the subset of Q where g £0. 


Examples of continuous maps: 


hi(z,y)=e7+siny ondomh; =R?, 
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ho(z,y) = ylogx on dom hz = (0,+00) x R, 


h3(x,y,2z) = me on dom h3 = R® \ ({0} x R x {0}). 


iii) If f is continuous on 2 CR” and g is continuous on I CR, the composite 


map go f is continuous on domgo f ={xeE 2: f(x) € I}. 


For instance, 

hi(x, y) = log(1 + zy) on dom h, = {(z,y) € R?: zy > —1}, 

x + y? 
z—1 


ha(z,y,2) = on dom hz = R? x (R \ {1}), 


h3(@) = ~/at+...+24 ondomh3=R”. 


In particular, Proposition 4.20 and criterion i) imply the next result, which 
is about the continuity of a composite map in the most general setting. 


Proposition 4.23 Let 
f :domf C R” > R”, g:domg C R™ > R? 


be functions and x € dom f a point such that yo = f(xo) € domg. Consider 
the composite map 


h=gof:domhCR” >R’, 


where x) € domh. Then if f is continuous at xo and g is continuous at yo, 
h is continuous at xo. 


The definition of finite limit of a vector-valued map, for x > a € R”, is 
completely analogous to the one-variable case. From now on we will suppose f is 
defined on dom f C R” and ao € R” is a limit point of dom f. 


Definition 4.24 One says that f has limit @ € R™” (or tends to @) as x 
tends to Zo, in symbols 

lim f 

L>LO 


if for any « > 0 there is a 6 > 0 such that 


Va € dom f, 0<||jx—-ao||<6 = 


Ve € dom f, x € B;(ao) \ {xo} 
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As for one real variable, if f is defined on ao, then 


The analogue of Proposition 4.20 holds for limits, justifying the component-by- 
component approach. 


Examples 4.25 


i) The map 
41,4 
_ ery 
f(x,y) _ x2 + y? 
is defined on R? \ {(0,0)} and continuous on its domain, as is clear by using 
criteria 7) and 7) on p. 131. What is more, 


l f(z,y) =9, 


im 
(x,y) (0,0) 
because 
at ty! < at 4 2a?y? 4 yt = (2? 4 y?)? 
and by definition of limit 
at + y4 
x? + y? 
therefore the condition is true with 6 = Ve. 


<ety<e if O<|lall< ve; 


ii) Let us check that 
ja] + [yl 
lim  ——>= 


(w,y)>(0,0) 22 + y? ie 


Since 

ty? <a + Qla||y| ty? = (lel + ll)’, 
so that ||a|| < |x| + |y|, we have 

jz] +|yl _ lel+lyl 1 1 
ff4)= 6 - = a ee ee 
|x|? ll] lal] ~ dal] 

and f(z,y) > A if ||x|| < 1/A; the condition for the limit is true by taking 
6=1/A. 


A necessary condition for the limit of f(a) to exist (finite or not) as x — ao, is 
that the restriction of f to any line through ao has the same limit. This observation 
is often used to show that a certain limit does not exist. 


Example 4.26 
The map 
r+ y? 
f(x,y) = oy? 


does not admit limit for (x,y) — (0,0). Suppose the contrary, and let 
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L= im a 
(x,y) (0,0) Hey) 


be finite or infinite. Necessarily then, 
f= lim f(z, 0) = lim f (0,4). 
x—0 y—0 
But f(x,0) = x, so lim f(x,0) = 0, and f(0,y) = 1 for any y # 0, whence 
=> 
lim f(0,y) =1. 
yO 


One should not be led to believe, though, that the behaviour of the function 
restricted to lines through 20 is sufficient to compute the limit. Lines represent 
but one way in which we can approach the point xo. Different relationships among 
the variables may give rise to completely different behaviours of the limit. 


Example 4.27 
Let us determine the limit of 
zet/¥ ify #0, 


x, = . 
f(x,y) { . ify =0. 
at the origin. The map tends to 0 along each straight line passing through (0, 0): 


lim f(z,0) = lim f(0,y) = 0 
x—0 y—0 
because the function is identically zero on the coordinate axes, and along the 
other lines y = kx, k £0, 
lim fi tka) lim fe 0 
Yet along the parabolic arc y = 2”, x > 0, the map tends to infinity, for 
lim f(a,27) = lim zel/* = +00. 
x2—0t xz—0r 
Therefore the function does not admit limit as (x,y) — (0,0). 


A function of several variables can be proved to admit limit for x tending to xo 
(see Remark 4.35 for more details) if and only if the limit behaviour is independent 
of the path through ao chosen (see Fig. 4.12 for some examples). The previous 
cases show that if that is not true, the limit does not exist. 


LY 0 


Figure 4.12. Different ways of approaching the point ao 
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For two variables, a useful sufficient condition to study the limit’s existence 
relies on polar coordinates 


L=xXo+rcoss, y=yotrsindg. 


Proposition 4.28 Suppose there exist €€ R and a map g depending on the 
variable r such that, on a neighbourhood of (x0, yo), 


|f(zo + rcosé,yo +rsind) — £| < g(r) with lim g(r) = 0), 


r>0t 


Then 


lim Ge) ee 


(x,y) (wo,4o) 


Examples 4.29 
i) The above result simplifies the study of the limit of Example 4.25 i): 
4: 49 4 46 
f(rcos6,rsiné) = eee = r*(sin* @ + cos* @) 
r 
< r*(sin? 6 + cos? 0) =r?, 
recalling | sin@| < 1 and |cos6| < 1. Thus 
| f(r cos 0, rsin6)| < r? 


and the criterion applies with g(r) = r?. 


ii) Consider 
x logy 


in —=——. 
(2,y)>(0,1) \/a? + (y — 1)? 


r cos @ log(1 + rsin 6) 


Then 


f(rcos0#,1+rsin@) = = cos @log(1+rsin6). 


. 
Remembering that 
log(1+t 
lim ee) aun = 1. 
t30 t 
for t small enough we have 


log(1 + t) 

— 

ie., |log(1 + t)| < 2|t|. Then 
|f(rcos6,1+rsin@)| = |cos@log(1+rsin6)| < 2r|sin@cos6| < 2r. 

The criterion can be used with g(r) = 2r, to the effect that 

x logy 


tit, 
(x,y) 4(0,1) /a? + (y — 1)? 
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We would also like to understand what happens when the norm of the inde- 
pendent variable tends to infinity. As there is no natural ordering on R” for n > 2, 
one cannot discern, in general, how the argument of the map moves away from 
the origin. In contrast to dimension one, where it is possible to distinguish the 
limits for z + +oo and x — —ov, in higher dimensions there is only one “point” 
at infinity oo. Neighbourhoods of the point at infinity are by definition 


Br(oo) ={aeR":||e||>R} with R>O. 


Each Br(co) is the complement of the closed neighbourhood BR(0) of radius R 
centred at the origin. 

With this, the definition of limit (finite or infinite) assumes the usual form. 
For example a function f with unbounded domain in R” has limit @€ R” as x 
tends to ox, written 


if for any ¢ > 0 there is an R > 0 such that 
Vaedomf, |la|>R => |f(x)-4€<e, 
Lé., 


Va € dom f, zeEBr(o) => f(x) € B(2£). 


Eventually, we discuss infinite limits. A scalar function f has limit +00 (or 
tends to +00) as x tends to 20, written 


lim f(x) = +00, 
L—->LO 


if for any R > 0 there is a 6 > 0 such that 
Va € dom f, O<|le—a||<6 => fla)>R, (4.24) 
1.e., 


Va € dom f, x € Bs(ao)\{ao} = f(a) € Br(+oco). 
The definitions of 


lim f(x) = —oo and lim f(x) = —oo 
L>2XO F eo-e) 


descend from the previous ones, substituting f(a) > R with f(a) < —R. 


In the vectorial case, the limit is infinite when at least one component of f 
tends to oo. Here as well we cannot distinguish how f grows. Precisely, f has 
limit oo (or tends to oo) as x tends to zo, which one indicates by 


pA) ce: 
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if 
lim || f(«)|| = +00. 
L>LO 


Similarly we define 
lim f(x) =oo. 


@L—-Co 


4.5.1 Properties of limits and continuity 


The main theorems on limits of one-variable maps, discussed in Vol. 1, Ch. 4, 
carry over to several variables. Precisely, we still have uniqueness of the limit, 
local invariance of tha function’s sign, the various Comparison Theorems plus 
corollaries, the algebra of limits with the relative indeterminate forms. Clearly, 
az — c should be understood by thinking c as either a point a € R” or oo. 


We can also use the Landau formalism for a local study in several variables. 
The definition and properties of the symbols O, 0, =, ~ depend on limits, and 
thus extend. The expression f(x) = o(||a]||) as a — 0, for instance, means 


lim fz) =), 
20 ||a|| 
An example satisfying this property is f(a, y) = 2x? — 5y°. 


Some continuity theorems of global nature, see Vol. I, Sect. 4.3, have a counter- 
part for scalar maps of several variables. For example, the theorem on the existence 
of zeroes goes as follows. 


Theorem 4.30 Let f be a continuous map on a region R CR”. If f assumes 


on R both positive and negative values, it necessarily has a zero on R. 


Proof. Ifa,b € R satisfy f(a) < 0 and f(b) > 0, and if Pla,..., 6] is an arbitrary 
polygonal path in R joining a and b, the map f restricted to Pla,..., b] is 
a function of one variable that satisfies the ordinary Theorem of Existence 
of Zeroes. Therefore an a € Pla,...,b] exists with f(a) = 0. Oo 


From this follows, as in the one-dimensional case, the Mean Value Theorem. 
We also have Weierstrass’s Theorem, which we will see in Sect. 5.6, Theorem 5.24. 


For maps valued in R™, m > 1, we have the results on limits that make sense 
for vectorial quantities (e.g., the uniqueness of the limit and the limit of a sum of 
functions, but not the Comparison Theorems). 


The Substitution Theorem holds, just as in dimension one (Vol. I, Thm. 4.15); 
for continuous maps this guarantees the continuity of the composite map, as men- 
tioned in Proposition 4.23. 
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Examples 4.31 
i) We prove that 


lim eal —1. 
(x,y)—00 
As (2,y) — oo, we have |x| + |y| ~ +00, hence t = aaa — 0; but the 


exponential map t > e! is continuous at the origin, so the result follows. 


ii) Let us show 


ge y* 
i: == +00. 
(z,y)700 Z* +y 
This descends from 
ci+y* 1 2 2 


and the Comparison Theorem, for (x, y) + oo is clearly the same as ||a|| — ++0o. 
To get (4.25), note that if x? > y?, we have 
lal]? = 2? +y? < 22°, 
so 
AD Np 
x2+y2 ~ Qn? 2 
at the same result we arrive if y? > 2°. 


4.6 Curves in R™ 


A curve can describe the way the boundary of a plane region encloses the region 
itself — think of a polygon or an ellipse, or the trajectory of a point-particle mov- 
ing in time under the effect of a force. Chapter 9 will provide us with a means 
of integrating along a curve, hence allowing us to formulate mathematically the 
physical notion of the work of a force. 


Let us start with the definition of curve. Given a real interval J and a map 


= S- ri(t)e; € R™ the image of 
i=1 


+ : I — R™, we denote by y(t) = (Tilt) 1 ciccem 


t € I under +. 


Definition 4.32 A continuous function y :I CR — R™ is called a curve. 


The set I = (I) C R™ is said trace of the curve. 


If the trace of the curve lies on a plane, one speaks about a plane curve. 
The most common curves are those in the plane (m = 2) 
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We wish to stress the difference occurring between a curve, which is a function 
of one real variable, and its trace, a set in Euclidean space R™. A curve provides 
a way to parametrise its trace by associating to the parameter t € J one (and only 
one) point of the trace. Still, 7 may be the trace of many curves, because it may be 
parametrised in distinct ways. For instance the plane curves y(t) = (t,t), t € [0,1], 
and 6(t) = (t?,t?), t € [0,1] have the segment AB of end points A = (0,0) and 
B = (1,1) as common trace; y and 6 are two parametrisations of the segment. 


v2 


The mid-point of AB, for example, corresponds to the value t = 4 for y,t=% 


for 0. 


A curve ¥ is said simple if 7 is a one-to-one map, i.e., if distinct parameter’s 
values determine distinct points of the trace. 

If the interval J = [a,b] is closed and bounded, as in the previous examples, the 
curve 7 will be named arc. We call end points of the arc the points Pp = (a), 
P, = (0); precisely, Po is the initial point and P, the end point, and +¥ joins 
Po and P,. An arc is closed if its end points coincide: y(a) = y(b); although 
a closed arc cannot be simple, one speaks anyway of a simple closed arc (or 
Jordan arc) when the point y(a) = y(b) is the unique point on the trace where 
injectivity fails. Figure 4.13 illustrates a few arcs; in particular, the trace of the 
Jordan arc (bottom left) shows a core property of Jordan arcs, known as Jordan 
Curve Theorem. 


Co 
y(a 
(a) = ¥(0) (a) = ¥(0) 


Figure 4.13. The trace I’ = +({a,b]) of a simple arc (top left), a non-simple arc (top 
right), a simple closed arc or Jordan arc (bottom left) and a closed non-simple arc (bottom 
right) 


) (a) 
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Theorem 4.33 The trace I’ of a Jordan arc in the plane divides the plane 
in two connected components 3); and 7). with common boundary I’, where >); 


is bounded, '. 1s unbounded. 
Conventionally, 33; is called the region “inside” I’, while ©). is the region 
“outside” I’. 


Like for curves, there is a structural difference between an arc and its trace. 
It must be said that very often one uses the term ‘arc’ for a subset of Euclidean 
space R™ (as in: ‘circular arc’), understanding the object as implicitly parametrised 
somehow, usually in the simplest and most natural way. 


Examples 4.34 
i) The simple plane curve 
y(t) = (at +b,ct+d), teR, a40 
ad — be 


as trace. Indeed, putting = x(t) = at + b and 


a 
—b 
y = y(t) =ct+d givest = al 


has the line y = a + 
a 


, SO 


Come Ne re ee ace 
a a a 


ii) Let py: J > R be a continuous function on the interval J; the curve 
v(t) = tt + v(t), tel 
has the graph of as trace. 
iii) The trace of 
¥(t) = (x(t), y(t)) = (1 + cost, 3 + sint), t € [0, 27] 
is the circle with centre (1,3) and radius 1, for in fact (a(t) — 1)° + (y(t) - 


3)" = cos?t + sin?¢ = 1. The arc is simple and closed, and is the standard 
parametrisation of the circle starting at the point (2,3) and running counter- 
clockwise. 


More generally, the closed, simple arc 
¥(t) = (x(t), y(t)) = (xo +rcost,yotrsint),  t € [0,27] 
has trace the circle centred at (xo, yo) with radius r. 


If t varies in an interval [0,2k7], with k > 2 a positive integer, the curve has the 
same trace seen as a set; but because we wind around the centre k times, the 
curve is not simple. 


Instead, if ¢ varies in [0, 7], the curve is a circular arc, simple but not closed. 
iv) Given a,b > 0, the map 
¥(t) = (x(t), y(t)) = (acost, bsint), t € (0, 27] 
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is a simple closed curve parametrising the ellipse with centre in the origin and 
semi-axes a and b: 


2 2 
ee 
a b? 
v) The trace of 
¥(t) = (x(t), y(t)) = (tcost,tsint) = tcosti+tsintj, t € [0, +00] 


is a spiral coiling counter-clockwise around the origin, see Fig. 4.14, left. Since the 
point y(t) has distance ,/x?(t) + y?(t) = t from the origin, so it moves farther 
afield as t grows, the spiral a simple curve. 


vi) The simple curve 

+(t) = (x(t), yt), z(2) = (cost, sint, t) = costi+sintj+tk, teR 
has the circular helix of Fig. 4.14, right, as trace. It rests on the infinite cylinder 
{(z,y,z) € R3: 27 +y? =1} along the z-axis with radius 1. 


vii) Let P and Q be distinct points of R™, with m > 2 arbitrary. The trace of 
the simple curve 


y(t) =P+(Q-P)t, teER 


is the line through P and Q, because 7(0) = P, y(1) = Q and the vector y(t)— P 
has constant direction, being parallel to Q — P. 


There is a more general parametrisation of the same line 


y(t) =P+(Q- Pe , teéR, (4.26) 


with to A t1; in this case y(to) = P, y(ti) = Q. 


y z 
A 
x 
x 
y 


Figure 4.14. Spiral and circular helix, see Examples 4.34 v) and vi) 
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Some of the above curves have a special trace, given as the locus of points in 
the plane satisfying an equation of type 


f(a,y) =¢; (4.27) 


otherwise said, they parametrise level sets of f. With suitable assumptions on f, 
one can start from an implicit equation like (4.27) and obtain a curve y(t) = 
(x(t), y(t)) of solutions. The details may be found in Sect. 7.2. 


Remark 4.35 We posited in Sect. 4.5 that the existence of the limit of f, as x 
tends to a € R”, is equivalent to the fact that the restrictions of f to the trace 
of any curve through xp admit the same limit; namely, one can prove that 


Ino ag) te ibnan N) e 


LO Far 10) 


for any curve y: I + dom f such that (to) = ao for some tg € I. Oo 


Curves in polar, cylindrical and spherical coordinates 
Representing a curve using Cartesian coordinates, as we have done so far, is but 
one possibility. Sometimes it may be better to use polar coordinates in dimension 
2, and cylindrical or spherical coordinates in dimension 3 (these were introduced 
in Vol. I, Sect. 8.1, and will be discusses anew in Sect. 6.6.1). 

A plane curve R? can be defined by a continuous map 7p : J C R > [0, +00) xR, 
where p(t) = (r(t), 0(t)) are the polar coordinates of the point image of the value 
t of the parameter. It corresponds to the curve y(t) = r(t) cos 0(t) i+ r(t) sin O(t) 7 
in Cartesian coordinates. The spiral of Example 4.34 v) can be parametrised, in 
polar coordinates, by p(t) = (t,t), t € [0, +00). 

Similarly, we can represent a curve in space using cylindrical coordinates, 
y(t) = (r(t), 0(t), z(t)), with y. : I C R > [0,+00) x R? continuous, or using 
spherical coordinates, s(t) = (r(t), p(t), A(t), with ys. : I C R > [0,+00) x R? 
continuous. The circular helix of Example 4.34 vi) is y(t) = (1,t,t), t € R, 
while a meridian arc of the unit sphere, joining the North and the South poles, is 
s(t) = (1,t, 40), for t € [0,7] and with given 0 € [0, 27]. 


4.7 Surfaces in R® 


Surfaces are continuous functions defined on special subsets of R?, namely plane 
regions (see Definition 4.15); these regions play the role intervals had for curves. 


Definition 4.36 Let R C R? be a region. A continuous map o : R — R? is 


said surface, and © = o(R) CR? is the trace of the surface. 
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The independent variables in R are usually denoted (u,v), and 


is the Cartesian representation of o. 
A surface is simple if the restriction of o to the interior of R is one-to-one. 
A surface is compact if the region R is compact. It is the 2-dimensional 
analogue of an arc. 


Examples 4.37 


i) Consider vectors a, b € R® such that a A b 4 0, and c € R®. The surface 
a: R? > R 

o(u,v) =aut+bvt+ec 
(ayut biv + c1)é + (agu t+ bav + c2)Z + (agu + b3u + c3)k 


parametrises the plane JJ passing through the point c and parallel to a and b 
(Fig. 4.15, left). 


The Cartesian equation is found by setting x = (x,y, z) = o(u, v) and observing 


xz—-c=au+ bv, 

i.e., 2 — cis a linear combination of a and b. Hence by (4.7) 
(aA b)-(a—c)=(aAb)-au+(aAb)-bv=0, 
SO 
(a\b)-x%=(aAb)-c. 

The plane has thus equation 

axr+ By+yz=6, 
where a,3 and ¥ are the components of a A b, and 6 = (aA b)-c. 


& 


Figure 4.15. Representation of the plane J/ (left) and the hemisphere (right) relative 
to Examples 4.37 i) and ii) 
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ii) Any continuous scalar function y : R — R defined on a plane region produces 
a surface a : R > R® 


whose trace is the graph of y. Such a surface is sometimes referred to as topo- 
graphic (with respect to the z-axis). 
The surface 0: R = {(u,v) € R?: u? +0? <1} 5 R, 


o(u,v) =uitvjt+Vl—u?-vk 
produces as trace the upper hemisphere centred at the origin and with unit 
radius (Fig. 4.15, right). 
Permuting the components u, v, y(u,v) of o clearly gives rise to surfaces whose 
equation is solved for x or y instead of z. 


iii) The plane curve y : J C R > R? given by y(t) = (71(£), 0, y(t), v(t) > 0 
for any t € I has trace I’ in the half-plane xz where x > 0. 

Rotating I’ around the z-axis gives the trace Y of the surface o : I x [0,27] + R® 
defined by 


ao(u,v) = y1(u) cosvi t+ y1(u) sinvj + y¥3(u)k. 


One calls surface of revolution (around the z-axis) a surface thus obtained. 
The generating arc is said meridian (arc). For example, revolving around the 
z-axis the parabolic arc  parametrised by -y : [—1,1] > R°, y(t) = (4— #7, 0,2) 
yields the lateral surface of a plinth, see Fig. 4.16, left. 
Akin surfaces can be defined by revolution around the other coordinate axes. 
iv) The vector-valued map 

a(u,v) =(%o +rsinucosv)i + (yo +rsinusinv)j + (zo + rcosu)k 
on R = [0,7] x [0,27] is a compact surface with trace the spherical surface 
of centre % = (Xo, Yo, Zo), radius r > 0. Slightly more generally, consider the 
(surface of the) ellipsoid, centred at ao with semi-axes a,b,c > 0, defined by 


(w@=20)* , (y=wo)* , (z= 20)" _,. 


b? ce 
This surface is parametrised by 
a(u,v) = (a + asinucosv)t + (yo + bsinusinv)j + (zo + ccosu)k. 
v) The function 0 : R > R°, 
o(u,v) = ucosvti+usinvuj + vk 


with R = [0,1] x [0,47], defines a compact surface; its trace is the ideal parking- 
lot ramp, see Fig. 4.16, right. The name helicoid is commonly used for this 
surface defined on R = [0,1] x R. 


All surfaces considered so far are simple. 
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» 
~ 


oi 


Figure 4.16. Plinth (left) and helicoid (right) relative to Examples 4.37 iii) and v) 


We will see in Sect. 7.2 sufficient conditions for an implicit equation 


f(x,y,z) =e 


to be solved in one of the variables, i.e., to determine the trace of a surface which 
is locally a graph. 


For surfaces, too, there is an alternative representation making use of cylindrical 
or spherical coordinates. The lateral surface of an infinite cylinder with axis z 
and radius 1, for instance, has cylindrical parametrisation o. : [0,27] x R > 
R?, o-(u,v) = (r(u,v), O(u, v), 2(u,v)) = (1,u,v). Similarly, the unit sphere at 
the origin has a spherical parametrisation o, : [0,7] x [0,27] > R°, o,(u,v) = 
(r(u,v), p(u,v),A(u,v)) = (1,u,v). The surfaces of revolution of Example 4.37 
iii) are a, : I x [0,27] > R3, with o-(u,v) = (v1(u),v,73(u)) in cylindrical 
coordinates. 


4.8 Exercises 


1. Determine the interior, the closure and the boundary of the following sets. Say 
if the sets are open, closed, connected, convex or bounded: 


A={(a,y) €R?:0<2<1,0<y<1} 
B = (B2(0) \ ([-1, 1] x {0})) U ((-1, 1) x {3}) 
C = {(x,y) €R?: lyl > 2} 


2. Determine the domain of the functions: 


fa)| Flay )- 
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f(x,y) = V1—3ay 
fay) = Vet yt - 


Y-a@ 


f(@,y, 2) = log(x* + y* + 2° — 9) 


e) f(x,y) = (<a: es] 


] 
f(x,y, z) = — Y, “s*) 
eo 
g) y(t) =(@, vé—-1, V5-2) 
y(t) = (= log(9—t ) 
1 
: - 2 2 
o(u,v) = (log(1 —u*—v*), u, 2 ry 
— 2 
| o(u.r) = ( cS ert ary ) 
3. Say whether the ae limits exist, and compute them: 
: LY 
b lin ——— 
DP x,y) 0, 0) /a? + y? (a,y)+(0,0) 2? + y? 
x 3a7y 
lim —log(1l+-2 d lim ——— 
[o)| (x,y) (0,0) Y Bl ) ) (x,y)>(0,0) 27 + y? 
xy cy + yz? + 22? 
e) lm = ——> f) im SEE; SEC SY 
(«,y)+(0,0) 24 + y? (x,y,z)+(0,0,0) 2? +y? + 24 
, a2 ‘ : g2 — ety? + y3 
1m = 1m ——_ 4. a 
=L) era Oa pee y2 (x,y) (0,0) Gye 
: : TY : 2 2 2 2 
i lim L lim e+ log(a* + +5 
(x,y)+(0,0) & + Y ) (x,y =O y') lost ¥) 
2 1 \/1 + 3x? + By? 
ie i eee 
(a,y)400 a +y (x,y)—00 r+y 
4. Determine the set on which the following maps are continuous: 
f(x,y) = arcsin(xy — x — 2y) b) J@y2)= ae 


As the real number a > 0 varies, study the continuity of: 
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The cylinders z = x? and z = 4y? intersect along the traces of two curves. 
Find a parametrisation of the one containing the point (2, —1, 4). 


Parametrise (with a curve) the intersection of the plane x + y+ z = 1 and the 


cylinder z = x”. 


8. Parametrise (with a curve) the intersection of x? + y2 =16 andz=x+y. 


9. Find a Cartesian parametrisation for the curve y(t) = (r(t), 0(t)), t € [0, 27], 
given in polar coordinates, when: 


t 
ft 2 = mY 
y(t) = (sin? t, t) 4(t) = (sin 5,4) 
10. Eliminate the parameters u, v to obtain a Cartesian equation in x, y, z rep- 
resenting the trace of: 


a(u,v) = aucosvi + businvj + u7k, a,beR 
elliptic paraboloid 


b) o(u,v) = ut+ asinvj + acosvk, aéR 
cylinder 


a(u,v) = (a+ bcosu) sinvi + (a+ bcosu)cosvj +bsinuk, 0<b<a 
torus 


4.8.1 Solutions 
1. Properties of sets: 
a) The interior of A is 
A={(a,y) €R?:0<2<1,0<y<1}, 
which is the open square (0,1). Its closure is 


A={(2,y)€R?:0<2<1,0<y<1}, 


i.e., the closed square [0, 1]?. The boundary is 


0A = {(z,y) CR? : 2=O0oreg=1,0<y<1}U 
U{(z,y) ER? :y=0ory=1,0<2< 1}, 
which is the union of the four sides (perimeter) of [0, 1]?. 


Then A is neither open nor closed, but connected, convex and bounded, see 
Fig. 4.17, left. 
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y, 2 y, 
' A B 
2 C 
{ « cme if: 
=) 


Figure 4.17. The sets A, B, C of Exercise 1 


b) The interior of B is 


B = B,(0) \ ([-1,1] x {0}), 
the closure _ 
B = B2(0) U ({-1, 1] x {3}), 


the boundary 
OB = {(a,y) € R?: 27 4+ y? = 2} U ((-1, 1] x {0}) U ((-1, 1] x {3}). 


Hence B is not open, closed, connected, nor convex; but it is bounded, see 
Fig. 4.17, middle. 
c) The interior of C is C itself; its closure is 


C = {(x,y) € R?: [yl > 2}, 
while the boundary is 
OC Hay eR iy=221, 
making C open, not connected, nor convex, nor bounded (Fig. 4.17, right). 
2. Domain of functions: 


a) The domain is {(x, y) € R? : « # y?}, the set of all points in the plane except 
those on the parabola x = y?. 


b) The map is defined where the radicand is > 0, so the domain is 
1 1 
{(z,y) €R’ iy < = ife>0,y> 3, Pe <0 yeRifc=O}, 


which describes the points lying between the two branches of the hyperbola 
1 


Y = 35° 
c) The map is defined for 3x + y+1> 0 and 2y— 2 > 0, implying the domain is 
{(0,y) €R? : y > 3a 1} {(2,y) € RP sy > 5}. 


See Fig. 4.18. 
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y=—3x—1 


Ka 
| ¥ 
wie | 


Figure 4.18. The domain of the function f of Exercise 2. c) 


d) The function’s domain is defined by the positivity of the log’s argument, 


whence 


{(2,y,z) © RB: 2727 +y? +2? >9}. 


These are the points of the plane outside the sphere at the origin of radius 3. 
dom f = R? \ {0}. 
The map is defined for z > 0 and x ¥ 0; there are no constraints on y. The 


domain is thus the subset of R® given by the half-space z > 0 without the 
half-plane x = 0. 


dom = = [L, 5). 


t—2 
The components x(t) = rea and y(t) = log(9 — t”) of the curve are well 


defined for t #4 —2 and t € (—3,3) respectively. Therefore y is defined on the 
intervals J; = (—3, —2) and Ig = (—2,3). 

The surface is defined for 0 < u? + v? < 1, ie., for points of the punctured 
open unit disc on the plane wv. 

The domain of o is domo = {(u,v) € R?: u+vu > 0,u2 +0? F 4}. See 
Fig. 4.19 for a picture. 


> 


\ 


2 


~ 


Figure 4.19. The domain of the surface o of Exercise 2. £) 


150 4 Functions between Euclidean spaces 


3. Limits: 
a) From 
|x| = Va? < Va? + y? 
follows 
lee, 
\/x2 + y? 
Hence 


and by the Squeeze Rule we conclude the limit is 0. 
b) fy = 0 ore =0 the map f(t,y) = 55 


puting limits along the axes, then, the result is 0: 


is identically zero. When com- 


lim f(z,0) = lim f(0,y) =0. 
x—0 yO 


But along the line y = x the function equals 4 


5» SO 


li F (oe) = li a = = 
z—+0 z>0 0° +2 2, 
In conclusion, the limit does not exist. 
c) Let us compute the function f(x,y) = = log(1+<) along the lines y = kx with 
k £0; from 4 


f(a, ke) = Flog(1 +2) 


follows 
lim f(a, ke) =0. 
xz—-0 


Along the parabola y = x* though, f(x,x*) = = log(1+ 2), ice., 


line F(gye7) = 1: 


x—0 


The limit does not exist. 


a) 0, e) Does not exist. f) Does not exist. 
g) Since 
ae, 
cos* 6 
| f(r cos 0,rsin 0@)| = a =rcos?@ af, 
r 


Proposition 4.28, with g(r) =r, implies the limit is 0. 
h) 1. 
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i) From 
cos 6 — sind 


f(rcos6,r sin 6) => ey ay) 
follows § —sin0 
lim, f(rcos6,rsin@) = saa 
rT 


cos6 + sin 8’ 
thus the limit does not exist. 
£) 5. 
gt+ytl 
Set = Serthat 
m) Set f(x,y) Bay so tha 
a yt 
f(x,0) =— 3 and = f(0,y) = 
r Y 
Hence 
lim f(2,0)=1 and lim f(0,y) =0 
L—CO Yoo 


and we conclude the limit does not exist. 
n) Note 


2 2 Fae 
0< 4/1 + 3x? + by J 1+ 5(x? + y?) 


V 0,0). 
eg se te 40.0) 


Set t = 2? + y*, so that 


V1+5(x2 + y?) .  wVl+5t 
lim “———" = lim —— =0 
(a,y)—00 xe? + y? t+00 t 


and 


~~ (x%,y)00 v2 + y? ~ t-¥-+00 


The required limit is 0. 
4. Continuity sets: 


a) The function is continuous on its domain as composite map of continuous 
functions. For the domain, recall that arcsin is defined when the argument lies 
between —1 and 1, so 


dom f = {(x,y) € R?: -1 < 2y—2-2y <1}. 


Let us draw such set. The points of the line x = 2 do not belong to dom f. If 
x > 2, the condition —1+ a < (x —2)y < 1+ 2 is equivalent to 


1 xr—-1 r+1 3 
7s ; 
r-—2 xr—2 xr—2 r—2 
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this means dom f contains all points lying between the graphs of the two 


3 
hyperbolas y = 1+ 5 andy =1+ 7 oO Similarly, if « < 2, the domain is 
a 


rt 


1+ Ss <y<1+ : 
ln c—2Q- 


Overall, dom f is as in Fig. 4.20. 
b) The continuity set is C = {(z,y,z) € R8: 2427+ y7}. 


5. Let uscompute lim _ f(x,y). Using polar coordinates, 
(x,y)—>(0,0) 


f(rcos6,rsin 8) = r°~?| cos6|® sin(r sin 8) ; 
but | sint| < |t], valid for any t € R, implies 
|f(r cos 0, rsin @)| < r°—"| cos |*| sin O| , 


so the limit is zero if ~ > 1, and does not exist if a < 1. Therefore if a > 1 the 
map is continuous on R?, if 0 < a < 1 it is continuos only on R? \ {O}. 


2= 97 
z= Ay? 
gives x? = 4y?. The cylinders’ intersection projects onto the plane xy as the two 


lines = +2y (Fig. 4.21). As we are looking for the branch containing (2, —1, 4), 
we choose « = —2y. One possible parametrisation is given by t = y, hence 


6. The system 


A) == (tar); teR. 


Figure 4.20. The continuity set of f(x,y) = arcsin(xy — x — 2y) 
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Figure 4.21. The intersection of the cylinders z = x” and z = 4y? 


7. We have x + y +2? = 1, and choosing t = x as parameter, 


y(t) = (¢,1-t-?#,#), tER. 


8. Use cylindrical coordinates with x = 4cost, y = 4sint. Then 


y(t) = (4cost,4sint, 4(cost+ sint)) , t € [0, 27]. 


9. Curves in Cartesian coordinates: 


a) We have r(t) = sin? t and 6(t) = t; recalling that 2 = rcos@ and y = rsin60, 
gives 
x = sin’ tcost and y =sin?t. 
In Cartesian coordinates then, y : [0,27] — R? can be written y(t) = 
(x(t), y(t)) = (sin? tcost, sin® t), see Fig. 4.22, left. 


b) We have y(t) = (sin £cost,sin$sint). The curve is called cardioid, see 
Fig. 4.22, right. 


10. Graphs: 


a) It is straightforward to see 
2 2 
x 
= —_ u? cos? v + u? sin? v = u? = z, 
a b? 
giving the equation of an elliptic paraboloid. 
b) y?+27=a?. 
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Figure 4.22. Traces of the curves of Exercise 9. a) (left) and 9. b) (right) 


c) Note that 27+ y? = (a+ bcosu)? and a+bcosu > 0; therefore \/x2 + y2-a = 
bcosu. Moreover, 


(\/x? + y? — a)? + 2? = bcos? u + b? sin? u = B?; 


in conclusion 


(Ja? +y2 -—a)° +227 =07. 


5 


Differential calculus for scalar functions 


While the previous chapter dealt with the continuity of multivariable functions 
and their limits, the next three are dedicated to differential calculus. We start in 
this chapter by discussing scalar functions. 

The power of differential tools should be familiar to students from the first Cal- 
culus course; knowing the derivatives of a function of one variable allows to capture 
the function’s global behaviour, on intervals in the domain, as well as the local 
one, on arbitrarily small neighbourhoods. In passing to higher dimension, there 
are more instruments available that adapt to a more varied range of possibilities. 
If on one hand certain facts attract less interest (drawing graphs for instance, or 
understanding monotonicity, which is tightly related to the ordering of the reals, 
not present any more), on the other new aspects come into play (from Linear 
Algebra in particular) and become central. The first derivative in one variable is 
replaced by the gradient vector field, and the Hessian matrix takes the place of 
the second derivative. Due to the presence of more variables, some notions require 
special attention (differentiability at a point is a more delicate issue now), whereas 
others (convexity, Taylor expansions) translate directly from one to several vari- 
ables. The study of so-called unconstrained extrema of a function of several real 
variables carries over effortlessly, thus generalising the known Fermat and Weier- 
strass’ Theorems, and bringing to the fore a new kind of stationary points, like 
saddle points, at the same time. 


5.1 First partial derivatives and gradient 


The simplest case where partial derivatives at a point can be seen is on the plane, 
i.e., in dimension two, which we start from. 


Let f : dom f C R? + R be a function of two variables defined in a neighbour- 
hood of the point xo = (0, yo). The map z +> f(x, yo), obtained by fixing the 
second variable to a constant, is a real-valued map of one real variable, defined 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
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around the point x9 € R. If this is differentiable at xo, one says f admits partial 
derivative with respect to x at Zo and sets 


me f(z, yo) — (£0,490) 


wL— XO 


Similarly, if y+ f(xo, y) is differentiable at yo, f is said to admit partial deriv- 
ative with respect to y at xo and one defines 


The geometric meaning of partial derivatives is explained in Fig. 5.1. 


This can be generalised to functions of n variables, n > 3, in the most obvious 
way. Precisely, let a map in n variables f : dom f C R” — R be defined on the 
n 


neighbourhood of x = (%01,---,Zon) = >) oie, where e; is the 7th unit vector 


i=1 
of the canonical basis of R” seen in (4.1). One says f admits partial derivative 
with respect to x; at Xo if the function of one real variable 


XL t> f (x01, +++) U0 %i-1,%, TOi4+1,---; Lon) : 
= ps 
| Po 
i 
= i) 
] 
I 
ie 7] 
ee 
xv 


Gr ee 


Figure 5.1. The partial derivatives of f at xo are the slopes of the lines r1, r2, tangent 
to the graph of f at Po 


5.1 First partial derivatives and gradient 157 


obtained by making all independent variables but the 7th one constant, is differ- 
entiable at x = xo;. If so, one defines the symbol 


= Gat (ton 8 -,X0,i-1, XL, U0 i+1; Tee) Gon) 


- f(xo + Av ei) — f(xo) 


The partial derivative of f at ao with respect to the variable x; is also denoted 
as follows 


Def (0) ’ Dif (xo) ’ Tas (x0) 


(or simply f;(ao), if no confusion arises). 


Definition 5.1 Assume f admits partial derivatives at x9 with respect to all 
variables. The gradient Vf (ao) of f at xo is the vector defined by 


Zi = FF (as)er+...+ 24 (eo) en ER”. 


VF (wo) = (55 (@o) 


2; _— Ox 


Another notation for it is grad f(a). 


We remind that, as any vector in R", Vf(ao) can be written both as row or 
column vector, according to need. 
Examples 5.2 


i) Let f(x,y) = \/x? + y? be the function ‘distance from the origin’. At the point 
Lo = (2, =i), 


Of d 5 x 2 
— = — 1 _— — 
Ox (xo) ag Vue + _ =) TY - Je ) 
Of d -—-,> y 1 
Oy dy be J4t y? —_ Jd 
Therefore 
Vi (20) = (42) = 2-1) 
0) — vs V5 — /3 ’ : 


ii) For f(x,y, z) = ylog(2x — 3z), at ao = (2,3, 1) we have 


af 2 
ae = — 3log(2x — = 3 = 6 
Ox (20) an” Belen =) o=2 22 — 3|,.—9 
af a 

OL Geves —-alog ll, =O 

of (ay) = Lylogi|,_, =0, 
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d 
O (20) dz og ) z= 4 — 3z 


whence 
V f (Zo) = (6, 0, =H) 


iii) Let f : R° > R be the affine map f(a) =a-x2+b,a€R", DER. At any 
point ap € R”, 


Vf(xo) =a. 
iv) Take a function f not depending on the variable x; on a neighbourhood of 
O 
xo € dom f. Then of (ao) = 0. 


In particular, f constant around ao implies V f(a) = 0. 


The function 


defined on a suitable subset dom of C dom f C R” and with values in R, is called 


i 
(first) partial derivative of f with respect to x;. The gradient function of f, 


Vf: 2HVf (x), 


whose domain dom Vf is the intersection of the domains of the single first partial 
derivatives, is an example of a vector field, being a function defined on a subset of 
IR” with values in R”. 


O 
In practice, each partial derivative OF is computed by freezing all variables of 


Oty 


f different from x; (taking them as sonst ants), and differentiating in the only one 
left 2;. We can then use on this function everything we know from Calculus 1. 


Examples 5.3 


We shall use the previous examples. 


i) For f(x,y) = /z? + y? we have 
x 


V f(x) = (ee is) ee 
Jaap Vere) Tell 
with dom Vf = R? \ {0}. The formula holds in any dimension n if f(a) = |ja|| 
is the norm function on R”. 
ii) For f(x,y, z) = ylog(2x — 3z) we obtain 
Vie) = (5 lower =e) Zu =) 
with dom Vf = dom f = {(z, y,z) € R® : 2x — 3z > O}. 
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iii) The total resistance R produced by three conductors with resistances 
R,, Ro, Rz in a parallel circuit is given by the formula: 

| a 1 ik 

RR R, + Ro + Rs , 
We want to compute the partial derivative of R with respect to one of the R,, 
say R,. As 


1 R,R2R3 
R(R, Ro, R3) = ———_ = ———_—_—_——_ 
(Fi, Ra, Rs) Bt+e+ fh Rohs t+ RRs + RR 
we have 

OR Ree 

——(R,, Ro, R3) = a —*=Ee 

OR, (1 Ra, Ra) (RoR3 + Ri R3 + Ri Ro)? 

Partial derivatives with respect to the x7;, 7 = 1,...,n, are special cases of 


the directional derivative along a vector. Let f be a map defined around a point 
zo € R” and let v € R” be a given non-zero vector. Then f admits partial 
derivative, or directional derivative, along v at Xo if 


exists and is finite. 


Another notation for this is D,f (ao). The condition spells out the fact that 
the map t +> f(ao + tv) is differentiable at to = 0 (the latter is defined in a 
neighbourhood of to = 0, since x» +tv belongs to the neighbourhood of a9 where f 
is defined, for t small enough). See Fig. 5.2 to interpret geometrically the directional 
derivative. 


Figure 5.2. The directional derivative is the slope of the line r tangent to the graph of 
f at Po 
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The directional derivative of f at ao with respect to x; is obtained by choosing 
v = e;; thus 


as is immediate by comparing (5.2) with (5.1), using Az = t. 
In the next section we discuss the relationship between the gradient and direc- 
tional derivatives of differentiable functions. 


5.2 Differentiability and differentials 


Recall from Vol. I, Sect. 6.6 that a real map of one real variable f, differentiable 
at xo € R, satisfies the first formula of the finite increment 


f(x) = f(ao) + f'(x0)(a@ — xo) + o(@ — x0), LX. (5.3) 


This is actually equivalent to differentiability at xo, because if there is a number 
a € R such that 


f(x) = f(ao) +a(x — 29) + 0(@ — Xo), L—-> Xo, 


necessarily f is differentiable at xo, and a = f’(xq). From the geometric viewpoint, 
furthermore, differentiability at 79 amounts to the existence of the tangent line to 
the graph of f at the point Po = (ao, f(xo)): 


y = t(x) = f(xo) + f'(ao) (x — x0). 


In presence of several variables, the picture is more involved and the existence of 
the gradient of f at a does not guarantee the validity of a formula like (5.3), e.g., 


f(x) = f(@o) + VF(@o) -(a@— ao) + o(lja—aoll), we ao, (5.4) 


nor the existence of the tangent plane (or hyperplane, if n > 2) to the graph of f 
at Po = (xo, f(ao)) € R"*!. Consider for example the function 


2 


f(a.y) = Pap if (x,y) # (0,0), 


0 if (x,y) = (0,0). 


The map is identically zero on the coordinate axes (f(z,0) = f(0,y) = 0 for any 
x,y € R), so 


of _ of 7 Zz 
Fey (010) = 50,0) = 0, ie,  Vf(0,0) = (0,0). 
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If we had (5.4), at ao = (0,0) we would find 


xy = 


lim § —————— = 0; 
(x,y)—+(0,0) (a? + y?)\/a? + y? 
but moving along the line y = x, for instance, we have 


r x 1 40 
0 0 0 — er 
xz—+0+ 2/22? |x| 2/2 


Not even the existence of every partial derivative at 29 warrants (5.4) will hold. It 
is easy to see that the above f has directional derivatives along any given vector v. 
Thus it makes sense to introduce the following definition. 


Definition 5.4 A function f is differentiable at an interior point xo of the 
domain, if V f(ao) exists and the following formula holds: 


f(x) = f(@o) + Vf(@o)-(@— #0) + o(lla—aoll), ea. (5.5) 


In the case n = 2, 
z= f(@o) + Vf(#o) - (@— 20), (5.6) 


Le., 


z= f(£o, yo) + F(a, yo)(# — #0) + Fn (eo, Yyo)(Y¥ — Yo) 5 (5.7) 


defines a plane, called the tangent plane to the graph of the function f at Po = 
(x0, Yo, } (20, yo))- This is the plane that best approximates the graph of f on a 
neighbourhood of Po, see Fig. 5.3. The differentiability at ao is equivalent to the 
existence of the tangent plane at Po. In higher dimensions n > 2, equation (5.6) 
defines the hyperplane (affine subspace of codimension 1, i.e., of dimension n — 1) 
tangent to the graph of f at the point Po = (xo, f(ao)). 

Equation (5.5) suggests a natural way to approximate the map f around ao 
by means of a polynomial of degree one in x. Neglecting the higher-order terms, 
we have in fact 


f(x) ~ f(xo) + VF (ao) - (a — wo) 


on a neighbourhood of ao. This approximation, called linearisation of f at xo, 
often allows to simplify in a constructive and efficient way the mathematical de- 
scription of a physical phenomenon. 


Here is yet another interpretation of (5.5): put Aw = x — Zo in the formula, 
so that 


f(ao + Ax) = f(@o) + Vf (x0) - Ax + o(|| Aa) , Az > 0, 


162 5 Differential calculus for scalar functions 


z 


Figure 5.3. Tangent plane at Po 


in other words 
Af = f(ao + Ax) — f(a) = Vf (ao) - Ax + o(|| Axl) , Az > 0. 


Then the increment Af of the dependent variable is, up to an infinitesimal of order 
bigger than one, proportional to the increment Aa of the independent variable. 
This means the linear map Ax +> V f(a) - Ax is an approximation, often suffi- 
ciently accurate, of the variation of f in a neighbourhood of ag. This fact justifies 
the next definition. 


Definition 5.5 The linear map dfz, : RR” — R 


Ope 4 eal — Gan lee san 


is called differential of f at xo. 


Example 5.6 


Consider f(x,y) = /I+2+y and set wp = (1,2). Then Vf (ao) = (F,4) and 
the differential at ao is the function 


1 1 
djan( Ag Ay) = qa + q4y- 


Choosing for instance (Ar, Ay) = (+45, 4), we will have 
203 1 1 
Af =4,/— —2=0.014944... hil dfe,(— =, —) = 0.015. 
I= 50 ne while dfeo (799 aq) 


Just like in dimension one, differentiability implies continuity also for several 
variables. 
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Proof. From (5.4), 


lin Jia) = lim (f(ao) + Vf (xo) - (@ — @o) + o(||a — xoll)) = f(xo). 


L>LO L—-Lo 


This property shows that the definition of a differentiable function of several 
variables is the correct analogue of what we have in dimension one. 

In order to check whether a map is differentiable at a point of its domain, 
the following sufficient condition if usually employed. Its proof may be found in 
Appendix A.1.1, p. 511. 


Proposition 5.8 Assume f admits continuous partial derivatives on a 


neighbourhood of xo. Then f is differentiable at xo. 


Here is another feature of differentiable maps. 


Proposition 5.9 Jf a function f is differentiable at ao, it admits at ao dir- 
ectional derivatives along any vector v £0, and moreover 


0 
veep ot (20) Un - (5.8) 


Proof. Using (5.4), 


f(xo + tv) = f(xo) + tVf (xo) - v + o(|ltvll) , ||¢e|| + 0. 
Since ||tv|| = |é|||v|], we have o(||tw||) = o(t), t > 0, and hence 
Of _ 1. f(vo + tv) — f (xo) 
aa eae: t 
= tim _ v+o(t) _ Ce 


Note that (5.8) furnishes the expressions 


Of 
Ox; 


(ao) = e;- Vf (#0), i=1,...,n, 


which might turn out to be useful. 

Formula (5.8) establishes a simple-yet-crucial result concerning the behaviour 
of a map around 2p in case the gradient is non-zero at that point. How does 
the directional derivative of f at xo vary, when we change the direction along 
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0 
which we differentiate? To answer this we first of all establish bounds for oF (ao) 


when v is a unit vector (||v|| = 1) in R”. By (5.8), recalling the Cauchy-Schwarz 
inequality (4.3), we have 


O 

FF (a9)] < IV F(@o)l 
1.e., 

Of 
— IVF (ao) Il SZ (@o) SIV Feo) Il - 
bass V f (ao) Of ; ; 
Now, equality is attained for v = +————, and then (a) reaches its (posit- 
_— Mico] a 7") 


ive) maximum or (negative) minimum according to whether v is plus or minus the 
unit vector parallel to V f(a). In summary we have proved the following result, 
shown in Fig. 5.4 (see Sect. 7.2.1 for more details). 


Proposition 5.10 At points ao where the gradient of f does not vanish, f 


has the greatest rate of increase, starting from 2x0, in the direction of the 
gradient, and the greatest decrease in the opposite direction. 


The next property will be useful in the sequel. The proof of Proposition 5.9 
showed that the map y(t) = f(ao + tv) is differentiable at t = 0, and 


yp (0) = Vf (ao) -v. (5.9) 


Figure 5.4. Level curves and direction of steepest slope 
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More generally, 


Property 5.11 Leta, v € R” be given points and suppose f is differentiable 
at a+tov, to € R. Then the map y(t) = f(a+tv) is differentiable at to, and 


y'(to) = Vf (a+ tov) -v. (5.10) 


Proof. Set At = t— io, so that a+ itv = (a + tov) + Atv and then y(t) = 
f(a+tov + Atv) = (At). Now apply (5.9) to the map w?. 


Formula (5.10) is nothing else but a special case of the rule for differentiating 
a composite map of several variables, for which see Sect. 6.4. 


5.2.1 Mean Value Theorem and Lipschitz functions 
Let a, b be distinct points in R” and 
Sia, b] = {a(t) = (1-tha+tb:0<t<1} 


be the segment between a and 5, as in (4.19). The result below is the n-dimensional 
version of the famous result for one-variable functions due to Lagrange. 


Theorem 5.12 (Mean Value Theorem or Lagrange Theorem) Let f : 
dom f C R” > R be defined and continuous at any point of S|a, bj, and 
differentiable at any point of S|a,b] with the (possible) exception of the end- 
points a and b. Then there exists an & € S|a,b| different from a, b such 
that 


f(b) — f(a) = VF): (b— a). (5.11) 


Proof. Consider the auxiliary map y(t) = f(w(t)), defined - and continuous - on 
[0,1] C R, as composite of the continuous maps t + a(t) and f, for any 
x(t). Because z(t) = a+t(b—a), and using Property 5.11 with v = b—a, 
we obtain that y is differentiable (at least) on (0,1), plus 


y(t) =Vif(x(t))-(b-a@), O<t<1. 


Then y satisfies the one-dimensional Mean Value Theorem on [0, 1], and 
there must be a f € (0,1) such that 


Putting © = a(t) in the above gives precisely (5.11). 
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As an application, we will find a result known to the reader at least in dimension 
one. 


Proposition 5.13 Let R be a region in R” and f :R CR” > R a continu- 
o) 


ous map on R, differentiable everywhere on A= R. Then 


Vi =O) on A ——s f is constant on R. 


Proof. We have seen that f constant on R, hence on A a fortiori, implies Vf = 0 
at each point of A (Example 5.2 iv)). 
Conversely, let us fix an arbitrary a in A, and choose any b in R. There 
is a polygonal path Plao,...,a,| with ao = a, a, = b, going from one 
to the other (directly from the definition of open connected set if b € A, 
while if b € OA we can join it to a point of A through a segment). On 
each segment S|a;_1,a,;|, 1 <j <r forming the path, the hypotheses of 
Theorem 5.12 hold, so from (5.11) and the fact that Vf = 0 identically, 
follows 


f(aj) — f(aj-1) =0, 


whence f(b) = f(a) for any b € R. That means f is constant. Oo 


Consequently, if f is differentiable everywhere on an open set A and its gradient 
is zero on A, then f is constant on each connected component of A. 


The property we are about to introduce is relevant for the applications. For 
this, let 7 be a region of dom f. 


Definition 5.14 The map f is Lipschitz on R if there is a constant L > 0 
such that 


lf ei f es) = Ella — w5l Var, eo © e. (5.12) 


The smallest number L verifying the condition is said Lipschitz constant 
of f on R. 


By definition of least upper bound we easily see that the Lipschitz constant of 


pete |f(w1) — f(w2)| 
sup —————_—_—_. 
@1,229ER ||e1 —_ z|| 
21 Ax2Q 


To say that f is Lipschitz on R tantamounts to the assertion that the supremum 
is finite. 


Examples 5.15 


i) The affine map f(x) = a-x2+b (a € R”, bE R) is Lipschitz on R”, as shown 
by (4.21). It can be proved the Lipschitz constant equals |lall. 
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ii) The function f(a) = ||x||, mapping x € R” to its Euclidean norm, is Lipschitz 
on R” with Lipschitz constant 1, as 
[ileal] — l]xall| < ler - al], Vai, a2 © R”. 

This is a consequence of #1 = %2 + (a1 — 2) and the triangle inequality (4.4) 

Ileal] < [|e] + ler — wel], 
L€., 

Ileal] — ||e2l| < |e — wel]; 


swapping x; and 22 yields the result. 


iii) The map f(a) = ||a||? is not Lipschitz on all R”. In fact, choosing a2 = 0, 
equation (5.12) becomes 
lleal|? < Lie], 


which is true if and only if ||a1|| < LZ. It becomes Lipschitz on any bounded region 
R, for 


[Naall? — le2l]?] = (laeal] + leit) [laa] — lel] 
<2M|||x1||—||woll|, Var,a2 ER, 


where M = sup{||x|| : a € R} and using the previous example. 


Note if R is open, the property of being Lipschitz implies the (uniform) con- 
tinuity on R. 
There is a sufficient condition for being Lipschitz, namely: 


Proposition 5.16 Let f be differentiable over a convex region R inside 
dom f, with bounded (first) partial derivatives. Then f is Lipschitz on R. 
Precisely, for any M > 0 such that 


of 


Fa) <M, VER, t= liens tte 


one has (5.12) with L=/nM. 


Proof. Pick 21, x2 € R. By assumption the segment S[a1, x2] is contained in R 
and f is differentiable (hence, continuous) at any point of S|[a1, #2]. Thus 
by the Mean Value Theorem 5.12 there is an % € R with 


f(@1) — f(@2) = VF) - (a1 — £2). 
The Cauchy-Schwarz inequality (4.3) tells us 
|f(w1) — f(@2)| < ||VF(@)|| |]w1 — x]. 


To conclude, observe 
gx 1/2 “ 1/2 
z < (soa") = /nM. 
i=1 


nm 


IVE@|l = (>: 


i=1 
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The assertion that the existence and boundedness of the partial derivatives 
implies being Lipschitz is a fact that holds under more general assumptions: for 
instance if R is compact, or if the boundary of FR is sufficiently regular. 


5.3 Second partial derivatives and Hessian matrix 


admits 


Let f admit partial derivative in x; on a whole neighbourhood of xo. If 
vj 

first derivative at 29 with respect to 7;, we say f admits at x second partial 

derivative with respect to x; and x;, and set 


af Wee: 
OLGOn, (x0) = Ox; (34) (x0) : 


If i # 7 one speaks of mixed second partial derivatives, while for 7 = 7 second 
; —— ' om 

partial derivatives are said of pure type and denoted by the symbol ae Other 
Hi 


v 
ways to denote second partial derivatives are 


Dé.,,4 (®o) ; D3, f (xo) ) Fee s2; (xo) ’ filo) : 
In case 7 is different from j, and assuming f admits at ao mixed derivatives 
Os (ao) and iit 
n 
OL {O25 : OxL,0n; 


(ao), these might differ. Take for example 


if (x,y) # (0,0), 


0 if (4. y) = (0,0), 
Of Of 
f i h that —— = —1 while —— = 1. 
a function such that aon while Daag? 9) 


We have arrived at an important sufficient condition, very often fulfilled, for 
the mixed derivatives to coincide. 


2) 


Theorem 5.17 (Schwarz) If the mixed partial derivatives 7 and 
LjOXL; 


Oo? f 
CL.00, 
they coincide at xo. 


(j # i) exist on a neighbourhood of x9 and are continuous at xo, 


Proof. See Appendix A.1.1, p. 512. oO 
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Definition 5.18 Jf f possesses all second partial derivatives at xo, the matrix 


Hf (0) = (hij )r<ijen where 


om 
Ox? 
2 

O- f - 


Ax1025 (ao 


(xo) 


nee, 
O02r10%y y 


is called Hessian matrix of f at xo (Hessian for short). 


The Hessian matrix is also commonly denoted by Hess f(ao), or Hy (xo). 

If the mixed derivatives are continuous at xo, the matrix Hf(ao) is symmet- 
ric by Schwarz’s Theorem. Then the order of differentiation is irrelevant, and 
feie;(20), fig(@o) is the same as fz ,2,(@0), fji(X0). 

The Hessian makes its appearance when studying the local behaviour of f at 
Zo, aS explained in Sect. 5.6. 


Examples 5.19 
i) For the map f(x,y) = xsin(a + 2y) we have 


FE (a, y) = sin(z + 2y) + xcos(a# + 2y), Sa y) = 2x cos(x + 2y), 
so that 
Of 
a2) = 2cos(x + 2y) — xsin(x + 2y), 
2 2 
se) = Se) = 2cos(x + 2y) — 2x sin(a# + 2y), 
oF 


ye) = —Agsin(x + 2y). 


At the origin x) = O the Hessian of f is 


Hf(0) = ¢ ) . 


ii) Given the symmetric matrix A € R"*”, a vector b € R” and a constant 
c € R, we define the map f : R” > R by 


F(z) =a2-Ar+b-r+c=) 2, [Sen +S byt +c. 


p=1 q=1 p=1 
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For the product Az to make sense, we are forced to write x as a column vector, 
as will happen for all vectors considered in the example. By the chain rule 


Of = Ox “.. Ox 
3g, (2) = —— > Fat (Sots) + +m (Song | + oops. 
uv p=1 7 
For each pair re ‘les p, 1 between 1 a ae 


Obp 5s _ Ll: tf p=, 
Ox; » \0 ifpFi, 


sO 


nm nm 
= ) Qiglg + ) Lp Omi + Oy . 
q=1 = 


On the other hand A is symmetric and the summation is index arbitrary, so 


nm n n 
) Lpapi = ) Aiplpy = ) Qiqhg¢ - 
p=1 p=1 q=1 
Then 


OF ” 
Ba, ®) = 2. Gey + by, 


q=1 
Les 


Vif(x) =2An+b. 
Differentiating further, 


Af (x) =2A. 
Note the Hessian of f is independent of a. 


iii) The kinetic energy of a body with mass m moving at velocity v is K = Smv-. 


Then 
il 2 0 U 
VK(m,v) = (5? ,mv) and HkK(m,;v) = . 
Note that 


5.4 Higher-order partial derivatives 
off 
On;OR; 


O is 
atives of the functions ae under Schwarz’s Theorem, the order of differentiation 
Gs 


The second partial derivatives of f have been defined as first partial deriv- 


7 
is inessential. Similarly one defines the partial derivatives of order three as first de- 
2 


O 
Omen, 


rivatives of the maps (assuming this is possible, of course). In general, by 
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successive differentiation one defines kth partial derivatives (or partial deriv- 
atives of order k) of f, for any integer k > 1. Supposing that the mixed partial 
derivatives of all orders are continuous, and thus satisfy Schwarz’s Theorem, one 
indicates by 

OF f 
(xo) 


a a Qn 
CHI Cie ee OES 


the kth partial derivative of f at ao, obtained differentiating a; times with respect 


to £1, G2 times with respect to %2, ..., @, times with respect to x,,. The exponents 
qa; are integers between 0 and k such that aj +a2+...+a, =k. Frequent symbols 
include D“ f(ao), that involves the multi-index @ = (a1, Q2,...,Qn) € N”, but 


also, e.g., ferry to denote differentiation in x twice, in y once. 


One last piece of notation concerns functions f, called of class C* (k > 1) 
over an open set {2 C dom f if the partial derivatives of any order < k exist and 
are continuous everywhere on 2. The set of such maps is indicated by C*(Q). 
By extension, C°(2) will be the set of continuous maps on 2, and C%(Q) the 
set of maps belonging to C*(Q) for any k, hence the functions admitting partial 
derivatives of any order on §2. Notice the inclusions 


CPU 6 a8 O12) CO) Coe). 


Thanks to Propostion 5.8, a function in C'(Q) is differentiable everywhere on 2. 


Instead of the open set 2, we may take the closure Q, and assume (2 is con- 


tained in dom f. If so, we write f € C°(Q) if f is continuous at any point of 2; 
for 1 < k < oo, we write f € C*(Q), or say f is C* on 2, if there is an open set 
with Qc 2' Cdomf and f € C*(2'). 


5.5 Taylor expansions; convexity 


Taylor expansions allow to approximate, locally, a function using a polynomial in 
the independent variables by the knowledge of certain partial derivatives, just as in 
dimension one. We already encountered examples; for a differentiable map formula 
(5.5) holds, which is the Taylor expansion at first order with Peano’s remainder. 
Note 


T fi,eo(@) = f(#0) + VF (xo) - (@ — wo) 


is a polynomial of degree less or equal than 1 in the x;, called Taylor polynomial 
of f at xo of order 1. Besides, if f is C! on a neighbourhood of xo, then (5.11) 
holds with a = a and b= @ arbitrarily chosen in the neighbourhood, so 


(5.14) 
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Viewing the constant f(a) as a 0-degree polynomial, the formula gives the Taylor 
expansion of f at ao of order 0, with Lagrange’s remainder. 

Increasing the map’s regularity we can find Taylor formulas that are more and 
more precise. For C? maps we have the following results, whose proofs are given 
in Appendix A.1.2, p. 513 and p. 514. 


Theorem 5.20 A function f of class C? around xo admits at ao the Taylor 
expansion of order one with Lagrange’s remainder: 


f(a) = f(#o) + Vf (wo) - (@ — 0) + (2 — ao) Hf(®)(@— ao), (5.15) 


where & is interior to the segment S|x, xo]. 


Formulas (5.4) and (5.15) are two different ways of writing the remainder of f 
in the degree-one Taylor polynomial Tf1,2,(x). 
The expansion of order two is given by 


Theorem 5.21 A function f of class C? around x9 admits at a9 the follow- 
ing Taylor expansion of order two with Peano’s remainder: 


f(x) = f(xo) + Vf (xo) - (@ — #0) + 5 (2 — «o)- Hf (ao)(# — ao) (5.16) 


The expression 


is a polynomial of degree < 2 in the x;, called Taylor polynomial of f at 
zo of order 2. It gives the best quadratic approximation of the map on the 
neighbourhood of a (see Fig. 5.5 for an example). 

For clarity’s sake, let us render (5.16) explicit for an f(x,y) of two variables: 


f(z,y) = F (£0, yo) + fe (Xo, yo)(% — 20) + fy(Zo, yo) (y — yo) 
+5 fan, yo) (x _ xo)” — teal 20: Yo) (x -_ ro) (y _ Yo) ss + fay (0 Yo) (y = yo)? 
+0((a — £0)? + (y — yo)”), (x,y) — (Xo, Yo) - 


For the quadratic term we used the fact that the Hessian is symmetric by Schwarz’s 
Theorem. 

Taylor formulas of arbitrary order n can be written assuming f is C” around 
xo. These, though, go beyond the scope of the present text. 
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T f2,a0 


Figure 5.5. Graph of the order-two Taylor polynomial of f at ao (osculating paraboloid 
at Po) 


5.5.1 Convexity 


The idea of convexity, both global and local, for one-variable functions (Vol. I, 
Sect. 6.9) can be generalised to several variables. 

Global convexity goes through the convexity of the set of points above the 
graph, and namely: take f : C C R” > R with C a convex set, and define 


Ey = { (a, y)ER™! :2EeC,y> fae) ys 


Then f is convex on C if the set Ey is convex. 


Local convexity depends upon the mutual position of f’s graph and the tangent 
plane. Precisely, f differentiable at a) € dom f is said convex at Zo if there is a 
neighbourhood B,(ao) such that 


f(x) = f(@o) + Vfl@o)-(@— #0), Va € B,(x0); 


f is strictly convex at xo if the inequality is strict for any x # xo. 


It can be proved that the local convexity of a differentiable map f at any point 
in a convex subset C' C dom f is equivalent to the global convexity of f on C. 


Take a C? map f around a point zg in dom f. Using Theorem 5.21 and the 
properties of the symmetric matrix Hf (ao) (see Sect. 4.2), we can say that 


Hf (xo) is positive semi-definite <= ff is convex at xo 


Hf (x0) is positive definite => f is strictly convex at x. 
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5.6 Extremal points of a function; stationary points 
Extremum values and extremum points (local or global) in several variables are 
defined in analogy to dimension one. 

Definition 5.22 A point x € domf is a relative (or local) maximum 


point for f if, on a neighbourhood B,(ao) of xo, 


Va € B,(xo) dom f, HCD) Ss Gag) 


The value f(ao) is a relative maximum of f. 


Moreover, Zo is said an absolute, or global, maximum point for f if 
Vax € dom i f(x) = f(2o), 


and correspondingly f (ao) is the absolute maximum of f. Either way, the 
maximum is strict if f(x) < f(ao) when x F ao. 


Inverting the inequalities defines relative and absolute minimum points. 
Minimum and maximum points alike are called extrema, or extremum points, 


for f. 


Examples 5.23 


i) The map f(x) = ||x|| has a strict global minimum at the origin, for f(0) = 0 
and f(a) > 0 for any x 4 0. Clearly f has no maximum points (neither relative, 
nor absolute) on R”. 

ii) The function f(z, y) = a2(e"¥ — 1) is always < 0, since x? > 0 and ev <1 
for all (x,y) € R*. Moreover, it vanishes if z = 0 or y = 0, ie., f(0,y) = 0 for 
any y € R and f(z,0) = 0 for all  € R. Hence all points on the coordinate axes 
are global maxima (not strict). 


Extremum points as of Definition 5.22 are commonly called unconstrained, be- 
cause the independent variable is “free” to roam the whole domain of the function. 
Later (Sect. 7.3) we will see the notion of “constrained” extremum points, for which 
the independent variable is restricted to a subset of the domain, like a curve or a 
surface. 

A sufficient condition for having extrema in several variables is Weierstrass’s 
Theorem (seen in Vol. I, Thm. 4.31); the proof is completely analogous. 


Theorem 5.24 (Weierstrass) Let f be continuous on a compact set K C 
dom f. Its image f(K) is a closed and bounded subset of R, and in particular, 


f(K) is a closed interval if K is connected. 
Consequently, f has on K a maximum and a minimum value. 
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For example, f(a) = |\x|| admits absolute maximum on every compact set 
Kk CR". 
There is also a notion of critical point for functions of several variables. 


Definition 5.25 A point xo at which f is differentiable is critical or sta- 
tionary for f if Vf(ao) = 0. If, instead, Vf(ao) 40, xo is said a regular 


point for f. 


By (5.8) a stationary point annihilates all directional derivatives of f. 


In two variables stationary points have a neat geometric interpretation. Re- 
calling (5.7), a point is stationary if the tangent plane to the graph is horizontal 
(Fig. 5.6). 

It is Fermat’s Theorem that justifies the interest in finding stationary points; 
we state it below for the several-variable case. 


Theorem 5.26 (Fermat) Let f be differentiable at the extremum point xo. 


Then xo is stationary for f. 


Proof. By assumption the map of one variable 
Le> Fi Rois see) U0,i-1, 7, VOji4+1,---, tity) 


is, for any 7, defined and differentiable on a neighbourhood of xo,;; the latter 
is an extremum point. Thus, Fermat’s Theorem for one-variable functions 


gives ee =. 
Ox 


a 


Pi 


P» Ps 
Sa ee SOFSSS 
Py 


Figure 5.6. Tangent planes at stationary points in two variables 
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In the light of this result, we make a few observations on the relationship 
between extrema and stationary points. 


i) Having an extremum point for f does not mean f is differentiable at it, nor 
that the point is stationary. This is exactly what happens to f(x) = ||a|| of Ex- 
ample 5.23 i): the origin is an absolute minimum, but f does not admit partial 
derivatives there. In fact f behaves, along each direction, as the absolute value, 
fOt yf (Oncnes Oia Dees 0) lal. 


ii) For maps that are differentiable on the whole domain, Fermat’s Theorem 
provides a necessary condition for an extremum point; this means extremum points 
are to be found among stationary points. 


iii) That said, not all stationary points are extrema. Consider f(x,y) = x3 y?: the 
origin is stationary, yet neither a maximum nor a minimum point. This map is 
zero along the axes in fact, positive in the first and third quadrants, negative in 
the others. 


Taking all this into consideration, it makes sense to search for sufficient con- 
ditions ensuring a stationary point xo is an extremum point. Apropos which, the 
Hessian matrix Hf (a) is useful when the function f is at least C? around a9. With 
these assumptions and the fact that xo is stationary, we have Taylor’s expansion 
(5.16) 

f(x) — f(@o) = Q(@ — 20) + o(||a—xol|?), 2 a0, (5.17) 


where Q(v) = 3v- Hf(ao)v is the quadratic form associated to the symmetric 
matrix Hf (ao) (see Sect. 4.2). 
Now we do have a sufficient condition for a stationary point to be extremal. 


Theorem 5.27 Let f be C? on some neighbourhood of xo, a stationary point 
for f. Then 


i) if Zo is a minimum (respectively, maximum) point for f, Hf(ao) is pos- 


itive (negative) semi-definite; 
ii) if Hf (xo) is positive (negative) definite, the point xo is a local strict min- 
imum (maximum) for f. 


Proof. i) To fix ideas let us suppose ao is a local minimum for f, and B,.(ao) is a 
neighbourhood of ao where f(x) > f(ao). Choosing an arbitrary v € R”, 
let = a + ev with ¢ > 0 small enough so that a2 € B,(ao). From (5.17), 


Q(x — x0) + o(||a — aol|?) > 0, ro. 


But Q(x—2x0) = Q(ev) = e7Q(v), and o( || — xol|?) o(e?||v||?) = ¢0(1) 
as € + 0*. Therefore 


oO) +2001) 20; e+, 
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1.€., 
Q(v) + 0(1) > 0, e>0t. 

Taking the limit « — 0* and noting Q(v) does not depend on e, we get 

Q(v) > 0. But as v is arbitrary, Hf(axo) is positive semi-definite. 

ii) Let Hf (a) be positive definite. Then Q(v) > allv||? for any v € R”, 

where a = X,./2 and X, > 0 denotes the smallest eigenvalue of Hf (xo) 

(see (4.18)). By (5.17), 
f(x) — f(xo) 


a||x — xo||? + lla — aol|7o(1) 
= (a + 0(1)) |x — xo”, x Xo. 


IV 


On a neighbourhood B,.(a) of sufficiently small radius, a+o(1) > 0 hence 
f(x) = f (x0). O 


As corollary of the theorem, on a neighbourhood of a minimum point xo for 
f, where Hf(a) is positive definite, the graph of f is well approximated by the 
quadratic map g(a) = f(a) + Q(x — ao), an elliptic paraboloid in dimension 2. 
Furthermore, the level sets are approximated by the level sets of Q(a — a0); as we 
recalled in Sect. 4.2, these are ellipses (in dimension 2) or ellipsoids (in dimension 
3) centred at Xo. 


Remark 5.28 One could prove that if f is C? on its domain and Hf (a) every- 
where positive (or negative) definite, then f admits at most one stationary point 
Zo, which is also a global minimum (maximum) for f. Oo 


Examples 5.29 
i) Consider 


f(z,y) = dre (@* +9") 
on R? and compute 


am y) =2(1- 2x2 \e7(#* +9") , Sela, y) = _Agye +4") 
The zeroes of these expressions are the stationary points x, = (2,0) and 
ty = (— wee 0). Moreover, 
Le, y) = 4x(2x? — 3)e7(@ +H") 
oF e.y) = sf, y) = 4y(2x? — Le (2? +9”) . 
oO? f 


Aap (ty) = Aa(2y? — Ye, 
y 
sO 


—Ar/2e71/2 0 
Hf (9) = ( ‘ Bee 
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U4 


Figure 5.7. Graph and level curves of f(x,y) = Qre7 (2 +4") 


and 
4/3 
np(— 2,0) = 4y/2¢e a . 
2 0 2/2e-1/2 


The Hessian matrices are diagonal, whence Hf(a,) is negative definite while 
Hf(x2) positive definite. In summary, #1 and 22 are local extrema (a local 
maximum and a local minimum, respectively). Fig. 5.7 shows the graph and the 
level curves of f. 


ii) The function 
1 1 
Ja?) = —+y +422 
2 z 
is defined on dom f = {(z,y,z) € R?: 2 #0 and z 40}. As 
1 1 
Vi (x,y, 2) = (-4+#2-5+2) ’ 


imposing V f(x,y, z) = 0 produces only one stationary point a = (1,0,1). Then 


2/x2 0 1 20 1 
Ae, Y, z) = 0 2 0 5 whence Af(1, 0, 1) = 0 2 0 
1 0 2/23 1 0 2 


The characteristic equation of A = Hf(1,0,1) reads 

det(A — AI) = (2- X)((2 — i)? - 1) = (), 
solved by A1 = 1, A2 = 2, A3 = 3. Therefore the Hessian at a9 is positive definite, 
making zo a local minimum point. 


5.6.1 Saddle points 


Recalling what an indefinite matrix is (see Sect. 4.2), statement 7) of Theorem 5.27 
may be formulated in the following equivalent way. 
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Proposition 5.30 Let f be of class C? around a stationary point xo. If 


Hf (ao) is indefinite, xo is not an extremum point. 


A stationary point xo for f such that Hf(ao) is indefinite is called a saddle 
point. The name stems from the shape of the graph in the following example 
around Zo. 


Example 5.31 


Consider f(x,y) = x? — y?: as Vf (x, y) = (2x, —2y), there is only one stationary 
point, the origin. The Hessian matrix 


Af (x,y) = ( =) 


is independent of the point, hence indefinite. Therefore the origin is a saddle 
point. 

It is convenient to consider in more detail the behaviour of f around such a 
point. Moving along the z-axis, the map f(z,0) = 2? has a minimum at the 
origin. Along the y-axis, by contrast, the function f(0,y) = —y? has a maximum 
at the origin: 


F(0, 0) = min f(x, 0) = max f(0,y). 


Level curves and the graph (from two viewpoints) of the function f are shown 
in Fig. 5.8 and 5.9. 


The kind of behaviour just described is typical of stationary points at which 
the Hessian matrix is indefinite and non-singular (the eigenvalues are non-zero and 
have different signs). Let us see more examples. 


UA 


ee, 


an 


Figure 5.8. Level curves of f(x,y) = 2? — y? 


180 5 Differential calculus for scalar functions 


Figure 5.9. Graphical view of the function f(x,y) = x? — y? from different angles 


Examples 5.32 
i) For the map f(x,y) = ry we have 
0 1 
Vilty)=(y2) and Af(2,y) = () ,) 
As before, #9 = (0,0) is a saddle point, because the eigenvalues of the Hessian are 
Ai = 1 and Ag = —1. Moving along the bisectrix of the first and third quadrant, 
f has a minimum at xo, while along the orthogonal bisectrix f has a maximum 
at Xo: 
F(0,0) = min f(a, x) = max f(x, —2). 
The directions of these lines are those of the eigenvectors w; = (1,1), we = 
(—1,1) associated to the eigenvalues \1, A2. 


Changing variables x = u— v, y= u+t v (corresponding to a rotation of 7/4 in 
the plane, see Sect. 6.6 and Example 6.31 in particular), f becomes 


f(x,y) = (u—v)(ut v) =a —v? _ f(u,v), 
the same as in the previous example in the new variables uw, v. 


ii) The function f(z, y, z) = 2? +y?—2z? has gradient Vf(z, y, z) = (22, 2y, —2z), 
with unique stationary point the origin. The matrix 


20 6 
0 0 -2 


is indefinite, and the origin is a saddle point. 
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A closer look uncovers the hidden structure of the saddle point. Moving on the 
ry-plane, we see that the origin is a minimum point for f(x, y,0) = 2? + y?. At 
the same time, along the z-axis the origin is a maximum for f(0,0,z) = —2?. 
Thus 
0,0,0) = min x,y, 0) = max f(0,0,2). 

F(0,0,0) = _ min, f(2,,0) = max F(0,0,2) 
iii) A slightly more elaborate situation is provided by the function f(x,y, z) = 
x? + y? — 27. Since Vf (x,y,z) = (2x, 3y”, —-2z) the origin is again stationary. 
The Hessian 


20 0 
Hf(0,0,0)=}]0 0 0 
GH 


is indefinite, making the origin a saddle point. More precisely, (0,0,0) is a min- 
imum if we move along the x-axis, a maximum along the z-axis, but on the y-axis 
we have an inflection point. 


The notion of saddle points can be generalised to subsume stationary points 
where the Hessian matrix is (positive or negative) semi-definite. In such a case, the 
mere knowledge of Hf(ao) is not sufficient to determine the nature of the point. 
In order to have a genuine saddle point 2) one must additionally require that there 
exist a direction along which f has a maximum point at xo and a direction along 
which f has a minimum at xo. Precisely, there should be vectors v1, v2 such that 
the maps t> f(ao +tv;), 7 = 1,2, have a strict minimum and a strict maximum 
point respectively, for t = 0. 

As an example take f(a, y) = x? —y*. The origin is stationary, and the Hessian 


0 
at that point is Hf(0,0) = « of hence positive semi-definite. Since f(x,0) = 


—y* has a maximum at y = 0, the above 


x? has a minimum at « = 0 and f(0,y) = 
= (1,0), ve = 7 = (0,1). We still call ap a 


requirement holds by taking v, = 7 
saddle point. 

Consider now f(x,y) = «?—y?, for which the origin is stationary the Hessian is 
the same as in the previous case. Despite this, for any m € R the map f(x,ma) = 
x? — m3x° has a minimum at x = 0, and f(0,y) = —y° has an inflection point at 
y = 0. Therefore no vector v2 exists that maximises t > f(tv2) at t = 0. For this 
reason Zo = O will not be called a saddle point. 

We note that if xo is stationary and Af (ax) is positive semi-definite, then every 
eigenvector w associated to an eigenvalue \ > 0 ensures that t +> f(a + tw) has 
a strict minimum point at t = 0 (whence w can be chosen as vector v1); in fact, 
(5.17) gives 


f(a + tw) = f(wo) + 5Alhwll*e? +02), 40. 
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Then 2 is a saddle point if and only if we can find a vector vg in the kernel of 
the matrix Hf(ao) such that t H f(a + tv2) has a strict maximum at t = 0. 
Similarly when Hf (ao) is negative semi-definite. 


To conclude the chapter, we wish to provide the reader with a simple procedure 
for classifying stationary points in two variables. The whole point is to determine 
the eigenvalues’ sign in the Hessian matrix, which can be done without effort for 
2 x 2 matrices. If A is a symmetric matrix of order 2, the determinant is the 
product of the two eigenvalues. Therefore if det A > 0 the eigenvalues have the 
same sign and A is definite, positive or negative according to the sign of either 
diagonal term aj or @22. If det A < 0, the eigenvalues have different sign, making 
A indefinite. If det A = 0, the matrix is semi-definite, not definite. 

Recycling this discussion with the Hessian Hf(ao) of a stationary point, and 
bearing in mind Theorem 5.27, gives 


xo strict local minimum point, if frx(ao) > 0 


det Hf(a%p) >0 => 


xo strict local maximum point, if fr2(ao) < 0 
det Hf (ao) <0 = Zo saddle point 


det Hf(xo) =0 => the nature of a cannot be determined by Hf (ao) only. 


In the first case det Hf(ao) > 0, the dichotomy maximum vs. minimum may be 
sorted out by the sign of fy,(ao), which is also the sign of fr2(a0o). 


Example 5.33 


The first derivatives of f(x,y) = 2ry + e (+) are 
2 2 
fe(t,y) =2(y—(e@ty)e"™), — fy(z,y) =2(@-(a@ type") ), 
and the second derivatives read 
fea(Z, y) = fuy(z,y) = —2e ety) (1 a 2(x a y)?) ’ 


fey(Z, y) = ace y) =2— Qe—(etu)" (1 -_ 2(x za y)*) . 
There are three stationary points 


1 1 
xo = (0,0), Ly, = (Sv log 2, 5 Vlog 2) , Y= -21, 


and correspondingly, 
ta 6 _ _ (1—2log2 14+2log2 
Hj(eo) = ( 0 = Hie) = Hi(es) = (17 ae ene) 


Therefore ap is a maximum point, for Hf(axo) is negative definite, while 2; and 
Z2 are saddle points since det Hf (x1) = det Hf (a2) = —8log2 < 0. 
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5.7 Exercises 


1. Compute the first partial derivatives at the points indicated: 


a) f(a,y) = V3r+y? at (xo, yo) = (1, 2) 
b) f(e,y,z) = ye" at (Zo, yo, 20) = (0,1, -1) 


y 
f(2,y) = 80? + i edt at (vo, yo) = (3,1) 
1 
2. Compute the first partial derivatives of: 


f(x,y) = log(a + V/x? + y?) 
f(e,y) = | Ca 
Flay,2t) = 2—Y 


f(v1,..-,%n) = sin(a1 + 242 +...+ nan) 


3. Compute the partial derivative indicated: 


a) ie, y) = ay? — Bry" , fuyy 
. af 
b) f(#,y) =asiny, aay 
c) f(z, Y; z) = @4? ’ Frey 
oF 
-_ abe 
d) faGresp Zz) =X y z y) Ox Oy20z3 


4. Determine which maps f satisfy fry + fy, = 0, known as Laplace equation: 
a) f(a,y)=a? +y? b) f(a,y) =a? + 32y? 
f(z,y) = log V2? + y? f(x,y) =e *cosy—e %cosz 
1 
Check that f(x,t) = e~' sin kz satisfies the so-called heat equation f,; = pate: 


6. Check that the following maps solve fi: = fr, known as wave equation: 


f(a, t) =sine sint f(x, t) = sin(# — t) + log(z + t) 
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Given 
yey F(2,y) # (0,0) 
L x, 2 ? 
fay =< e+e i 


10. 


I 


0 if (x,y) = (0,0) , 


a) compute f,(x,y) and fy(x,y) for any (x,y) € (0,0); 
b) calculate fr(0,0), fy(0,0) using the definition of second partial derivative; 
c) discuss the results obtained in the light of Schwarz’s Theorem 5.17. 


. Determine the gradient map of: 


a) F(e,y) = arctan! b) f(e.u) = (+ y) log (2a — y) 


c) f(x,y, z) = sin(a + y) cos(y — z) d) f(z,y,z) =(a@+y)” 


. Compute the directional derivatives along v, at the indicated points: 


a) f(x,y) =a2/y-3 v = (-1,6) xg = (2,12) 
b) Car z) = <= v= (2. —9, —4) Lo = Cl, 1, —1) 


Determine the tangent plane to the graph of f(x,y) at the point Py = 
(xo, Yo, f (®o, Yo): 


a) f(x,y) = 3x? — y? + 3y at Py = (-1,2, f(—1,2)) 


f(x,y) = ee at Po = ie i, ogee 1) 
c) f(x,y) =axlogy at Po = (4,1, (4, 1)) 


Relying on the definition, check the maps below are differentiable at the given 
point: 


f(a,y) =yva at (20, yo) = (4,1) 
f(x,y) = |yllog( +a) at (x0, yo) = (0, 0) 
f(2,y) = ay — 32? at (xo, yo) = (1, 2) 
Given 

one Bp Fey) 4 (0.0), 


0 if (x,y) = (0,0), 
compute f,(0,0) and f,(0,0). Is f differentiable at the origin? 


13. 


15. 


18. 


19.) 
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Study the differentiability at (0,0) of 


gy? 


f(e%,y) =< at + yt if (x,y) 4 (0,0), 
0 if (ay) = (0,0). 


Discuss the differentiability of 
f(x,y) = |a|sin(w? + y’) 
at any point of the plane. 


Study continuity and differentiability at the origin of 


* si 1 0 


as a varies in the reals. 


Study the differentiability at (0,0,0) of 


1 

7: 3 2) sin ———____ 
Ct (x+y +2°)s ae if (t,y, 2) # (0,0,0), 
: if (x,y, z) aa (0, 0, 0). 


Given f(x,y) = x? + 3xry — y”, find its differential at (xo, yo) = (2,3). If x 
varies between 2 and 2.05, and y between 3 and 2.96, compare the increment 
Af with the corresponding differential df ao ,o)- 


Determine the differential at a generic point (0, yo) of the functions: 


f(x,y) =e* cosy b) jay) =asmay 
c) f(x,y, 2) = log(a? + y? + 27) d) f(#,y,2) = ESE 


Find the differential of f(x,y, z) = x2°\/y2 + 22 at (2,3,4) and then use it to 
approximate the number 1.98? \/3.012 + 3.972. 


Tell whether 


1 
LZ, tF__— 
F(a, y) z+yt+l 


is Lipschitz over the rectangle R = |0,2] x [0,1]; if yes compute the Lipschitz 
constant. 
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21. Determine if, how and where in R® is 
f(x,y, z) = e7 (827 +2y*+2°) 
a Lipschitz map. 


22. Write the Taylor polynomial of order two for the following functions at the 
point given: 


f(x,y) = cosa cosy at (0, yo) = (0,0) 
b) F (234; z) Serra at (Xo, Yo; 20) = (1,1, 1) 
c) f(x,y, z) = cos(% + 2y — 3z) at (Xo, yo, Zo) = (0,0, 1) 


23. Determine the stationary points (if existent), specifying their nature: 


la) | f(e,y) = 2?y +0? — 2y f(x,y) = ylog(a + y) 

[e)| flay) =0t 50° + Ply? 1) f(x,y) =axye 27% 

[)] fey) =F +2 -y F(t,y) = 2ylog(2 22) + y? 
|2)| f(asy) = <8 eve f(x,y) = yloge 

f(x,y) = loge? + y? = 1) Fle,y,2) = aye +o + = 


Determine and draw the domain of 
f(,y) = Vy? — 2. 
Find stationary points and extrema. 
Determine domain and stationary points of 
f(z,y) = x7 log(1 + y) +2*y?. 


What is their nature? 


5.7.1 Solutions 


1. Partial derivatives of maps: 

Of 3 Of 2 
o. HO. i ea 5 
) Da | ) 2/7 By | ) V7 
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Of _1 Of Of 4 
b) Fo ,—1) ay 1 =0, De (0,1,-1) =e 
c) We have 
Of _ of See 
Da (2.9) = 16% and By oY) =e ; 


the latter computed by means of the Fundamental Theorem of Integral Cal- 


culus. Thus 
Of 


Fig1y=48 and (3,1) =e. 
Oy 


2. Partial derivatives: 


a) We have 
1 


l y 
f.(2,y) = ==>, Fa pS 
ou) aan YP me, cape Or ye «fae a 


b) Using the Fundamental Theorem of Integral Calculus we have 


foley) =—— | cost? dt = — cos 2”, 
Of: J, 


ary 
fyley) = 5 f cos t? dt = cosy”. 


c) We have 
1 1 
L\4, YG; su = 999") = ’ 
fA et) a Leelee p= 
Ye ur—y 
z af) »t = ’ 9S) »t — « 
fulz,y, z ) (z — t)? filz,y, 2 ) (z —t)? 
d) We have 
Of 
Dy, els stn) = cos(ai + 22+... + Mtn), 
1 
Of 
Fea, Ca 2 eae nie En) 
Of 
Dy Ely tn) = neos(a1 + 2a2+...+ Nan), . 


3. Partial derivatives: 


a) ding ==: by je Sings C) feys = yz7e"¥* (2+ xyz). 


d) Sexviye = abc(b 1)(c 1)(c 2g hye ; 
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4. Solutions of the Laplace equation: 


a) No. b) No. 
c) Since f(x,y) = 4 log(z? + y?), 
2 2 
fe fea = TEBE 
y oy? 
VS EP fw GEA 


hence frre + fyy = 0, Vx,y # 0. Therefore the function is a solution of the 
Laplace equation on R? \ {0}. 


d) Since 
fe =—e *cosyt+e Ysinz, fea =e *cosyte cosa, 
fy=—e “siny+e % cosa, lun = He “cosy —e * cosz, 


we have fre + fyy = 0, V(z,y) € R’, and the function satisfies Laplace’s 
equation on R?. 


5. The assertion follows from 


fi = —e ‘sinkz, fr = ke ' coska, fon = —k?e7* sin ker. 


6. Solutions of the wave equation: 


a) From 
fe =cosx sint, fer = —sinz sint, 
fi = sin zx cost, fie = —sinaz sint 
follows fre = fiz, V(a,t) € R?. 
b) As 
fe = cos(a —t) + : : ee 
a+t (x +t)? 
1 il 
fr = — coal —t) + ae cam ame corer 


we have fro = fix, V(z,t) € R? such that x +t > 0. 


7. a) Using the usual rules, for (x, y) 4 (0,0) we have 
rn (3a?y — y®)(x? + y?) — 2a(a?y — ey?) _ y(at + 4a?y? — y*) 
a\t, (x? me y2)? (x? te y?)? a 


pe eS) eae 
ues (a? + y?)? Gaye 


5.7 Exercises 
b) To compute f,(0,0) and f,(0,0) let us resort to the definition: 


Je, 0) ~~ f (0, 0) 


fc(0,0) = lim - = lim 0=0, 
Pd) iin DSTO) 52, 
y0 Yy y—0 
Moreover, 
al _ f2(0,¥) — fr(0, 0) y? 
fry(0,0) Oyo ae y ~ y0 yd =a 
_ OF _,. fy(x,0) — fy(0, 0) x 

F200) = Fray) = Be = Litt 
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c) Schwarz’s Theorem 5.17 does not apply, as the partial derivatives fry, fyx are 
not continuous at (0,0) (polar coordinates show that the limits Me . fog ey) 


“L,Y => 
and lim z(x,y) do not exist). 
(.y)—+(0,0) Fya(@,9) ) 


8. Gradients: 


c) Vile, Y, z) = 
d) Vf(a,y,z) = (e(a@t+y)**, e(a@t+y)*"*, (cx +y)* log(z + y)) . 


9. Directional derivatives: 


Of Of i 

(gy) =i, ae. 
a) (a0) ) (eo) =-2 
10. Tangent planes: 
a) z=—6x%—y+1. 
b) By (5.7), we compute 

2 2 2 2 
fx(z,y) = —2xe" ~* , fy(x,y) = 2ye* * , 


f-Liet, Wee s, Fie: 
The equation is thus 
z= f(-1,1) + fe(-1, 1) (@ +1) + fy(-1, Dy — 1) = 2a + 2y 4-1. 


c) z=4y-4. 


(cos(a + y) cos(y — z), cos(a + 2y — z), sin(x + y) sin(y — z)) . 
( 
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11. Checking differentiability: 

a) From f(a; y) = ONE and f,(x,y) = Vz follows 
1 

4 $ 
Then f is differentiable at (4,1) if and only if 


f(x,y) — F(4, 1) = fe(4, I)(@ = 4) = fyl4, DY =D) 


f(4,1) =2, fr(4, 1) = fy(4, 1) =2. 


lim =0 
(x,y) (4,1) (a — 4)? + (y—1)? 
L€., 
2—4 4)—2 1 
pe im Be g(@—4)—- 2-1) _, 
(x,y) (4,1) (2 — 4)? + (y— 1) 
Put c =4+rcoséey=1+4rsiné@ and observe that for r > 0, 
V44+ rcosé = 24/1 + = cos 6 =2 (1+ = 0s + o(r)) ; 
so that 
1 
yV@—2— =(@— 4) — 2y—1) = 21 + sind) (1 4 = cos + 0(r) + 
1 
—2— 7rcos@ — 2rsin# = o(r), r—>0. 
Hence 
L= lim elt) = lim o(1) = 0. 
r>0 7 r—0 


b) Note f(z,0) = f(0,y) = 0, so fr (0,0) = f,(0,0) = 0. Differentiability at the 
origin is the same as proving 
lid Fey) = 7 (0,0) = V7 (0,0) (x,y) 
(x,y)— (0,0) \/ a? + y? 


i.e., given that f(0,0) = 0, 


=(), 


im Yoga +e) _ 
(x,y)>(0,0)  ./a? + y? 


Polar coordinates come to the rescue: 


log(1 in 8 log(1 
UE) _ en Plo P08) 25 iad cose oie 
[2 + y? r 


Proposition 4.28 with g(r) = 2r allows to conclude. 
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c) We have f(1,2) = —1, f2(1,2) = —4 and f,(1,2) = 1; f is differentiable at 
(1,2) precisely when 


cy — 327 +1+4(2—1)—- (y— 2) 


L= lim =0. 
(x,y) (1,2) (a — 1)? + (y — 2)? 
Setting x =1+rcosé, y=2+rsin@ and computing, 


L = lim rcos 6(sin 6 — 3cos 6). 
r—>0 


The limit is zero by Proposition 4.28 with g(r) = 4r. 


12. Observe f(zx,0) = f(0,y) = 0, so 
f(z, 0) = f(0, 0) 


a ae a a a 
0,y) — f (0,0 
fy(0,0) = lim F(0,y) — F00,0) _ ling =O. 
yO Yy yO 


The map is certainly not differentiable at the origin because it is not even con- 


tinuous; infact lim _ f(x,y) does not exist, as one sees taking the limit along 
(x,y)— (0,0) 


the coordinate axes, 
lim f(x,0) = lim f(0,y) =0, 
x0 y—0 


and then along the line y = a, 
lim f(x, 2) = : 
oo, Lyx) = 5 


13. The function is not differentiable. 


14. The map is certainly differentiable at all points (x,y) € R? with x 4 0, by 
Proposition 5.8. To study the points on the axis, let us fix (0, yo) and compute the 
derivatives. No problems arise with f, since 


fy(z,y) = 2lalysin(a* +y*), = V(z,y) ER’, 
and f,(0, yo) = 0. As far as f, is concerned, 
PN i gD 
a (el sin(e? + 98). 


im 1 6&1 ¥o) — FO, Yo) _ 4, 
x20 tb x0 ib 


If yo = +J/nz, with n € N, we have 


f.(0,+Vn7) = lim lel _yyn sina? =0, 


z-0 2X 
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otherwise a 
x 
lim — sin(a? + y2) = sin y2, 
Jim, — sin(2" + yo) Yo 
while 
x 
lim a sin(x” + yg) = —siny?. 
z->0- & 
Thus f,(0, yo) exists only for yo = +/nz, where f is continuous and therefore 
differentiable. 


15. The map is continuous if a > 0; it is differentiable if a > 0. 
16. Observe f(x, 0,0) = x? sin 4, hence f,(0,0,0) = 0, since 


Ja]? 


= 7 (0, 0,0 
DB OHO Gees i a, 


xv x—0 |z| - 


Similarly, f,(0,0,0) = f.(0,0,0) = 0. For differentiability at the origin we consider 


1 
lim _f (e952) = lim fx? + y? + z2 sin ———— . 
(x,y,z)+(0,0,0) \/x?2 + y2 + 22 (#,y,z)-+(0,0,0) \/u2 + y? + 2? 


The limit is zero by the Squeeze rule: 


1 
0 < |x? + y? + 2? sin —————_| < 2? + y? + 2?, 
fu? + y? + 2? 


for all (x, y, z) € (0,0, 0). 
17. Since fz(x, y) = 2¢ + 3y, fy(x,y) = 3a — 2y we have 
df(2,3)(Aa,Ay ) = Vf(2,3) - (@ — 2,y — 3) = 13(@ — 2). 
Let now Aw = (2.05 — xo, 2.96 — yo) = (725, -74q), 80 
Af = f (2.05, 2.96) — f(2,3) = 0.6449 
and 


5 4 


df (2,3) 


18. Differentials: 
a) As fz(z,y) =e* cosy and f,(z,y) = —e* sin y, the differential is 


dca Oey ) = Vf (Xo, yo) - (Av, Ay ) = e”° cos yo Ax — e*° sin yo Ay. 


b) df(«o,yo) (Av, Ay ) = (sin coyo + LoYyo Cos toyo) Ax + x3 cos xoyo Ay. 
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229 2Yo 


d Az, Ay,Az) = ————- A =+—3,—; A 
) Meow ry AeA nde) = apap Aet aay ag et 
2 
+5 + zy Az. 
LH + Yo + 2% 
d) dfiae.an.z)(Av,A y,Az ) = ———— Ax — ———~ Ay — ————; 22. 
) Af(xo,40,20)(Ar,A y,Az ) yo + 229 (yo + 220)? (yo + 220)? / 
19. From 
xy xz 
alL,Y, 2 = 3? 24 22, LLY, 2) Ea > ALY, 2) =e 
fo(a,y, 2) vy fy(x,y, 2) Zee f(x,y, 2) Pa a 
follows 
24 32 
df(2,3,4) (A7,A4 y,Az ) = 60Ar + 5 4 + 5 oe 7 
Set Aw = (— 75, ig —i), 80 that 
2 1 3 
q ge a ah, 
fia. 799° 790" ~ To0 


By linearising, we may approximate 1.98° 3.012 + 3.97? by 


2 1 3 
f(2, 3, 4) =< df(2,3,4) 


(= a5 Too Tipg) = 40 — BS = 38.656. 


20. First of all 

OF ( ) = ory ) = ee ee 
Ox L,Y ~ Oy x,y > ( 
and secondly 


sup A (. y)| = sup (0, y)| =; 
(z,y)ER Ox (x, y)ER Ox 

Proposition 5.16 then tells us f is Lipschitz on R, with L = v2. 

21. The map is Lipschitz on the entire R°. 

22. Taylor polynomials: 

a) We have to find the Taylor polynomial for f at ao of order 2 


T fo,0.(@) = f(@o) + Vf (ao) - (@ — #0) + (2 — xo) Hf(xo)(x — x0). 


Let us begin by computing the partial derivatives involved: 


fr(z,y) = —sinz cosy so f(0,0) =0 
fy(x,y) = —cosxsiny so f,(0,0) =0 
fea (Z,y) = — cosxz cosy so fxe(0,0) = —1 
fuy(@, y) = —cosxcosy so fyy(0,0) =—1 


ia = jie) KH sinwsingy s0° fy (0,0) = fe(0,0) = 0 
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As f(0,0) = 1, 
1 —1 0 os 
T f2,(0,0) (2; y) =1+ 5 (ty) : 0 4 , ij 
1 1 1 
=l+- (= —yyel ==? = 9") 
+ 5 (2.9) -(—2, 9) 5t 5 


Alternatively, we might want to recall that, for x > 0 and y > 0, 


ih 1 
cosa =1— 52" +o(x"), cosy =1— sy" + o(y"); 
which we have to multiply. As the Taylor polynomial is unique, we find imme- 
diately T f2,(0,0) (2, y) =1— $x° = sy”. 


b) Computing directly, 
T foaan(@y2) = 8 (1+ @-1)+(y-+(@-1)+ 
+5 (0 —1)? +(y—1)?+(z-1)”)+ 
+(x —1)(y-1) + (#- 12-1) +(y-1)E-D). 


c) We have 
T f2,(0,0,1) (2, Ys 2) = cos3 + sin 3(a + 2y — 3(z—- 1) ah 


1 
+5 cos 3( xu? — dy? —9(z—1)?) + 
+ cos 3( — 2xy + 6a(z — 1) + 6y(z—1)). 


23. Stationary points and type: 


a) From 
0 2) 
oF (x,y) = 2n(y +1), S(x,y) = 07-2 
and the condition V f(x,y) = 0 we obtain the stationary points P, = (V2, —1), 
P, = (—/2, —1). Since 


ou oF 


Aqz oy) = 2 +1), Bye (x,y) =0, 
OF a7 f 

= = 2 
Buby y) aya y) = 22, 


the Hessians at those points read 


0 2/2 0 —9,/2 
aey=(,),™*). anm=(_2, ™*). 


In either case the determinant is negative, so P,, Pp are saddle points. 
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b) The function is defined for x + y > 0, i-e., on the half-plane dom f = {(z,y) € 
R?:y>—a}. As 


] Of 7] 
eae = tn log(a + y) + 
(x,y) y? By | t) g( y) a 


imposing V f(x,y) = 0 yields the system 


y —4 
ory 
log(x +y) +—— = 
x+y 


whose unique solution is the stationary point P = (1,0) € dom f. We have 
then 


a y can 1 x 
omer) — ’ x, = ; 
ag 4 (x+y)? Oy? ? r+y (x+y)? 
oF a7 x 
——( 7y) = ( ,Y) = 2? 
Ox Oy OyOx ey) 
Se) 
Hf(P)=(7 5) 
ALT 2 
The Hessian determinant is —1 < 0, making P a saddle point. 
c) From 
Of 5 Of 3 
7g ry) = 1 +2, By oY) y” — 2y 
and V f(x,y) = 0 we find three stationary points 
V2 v2 
P, = (-1,0), = (1): = (Slo 
Then 
O° f 4 Of O f O f 2 
—_ = === —_ ne = — = 12y* —-2. 
Ox2 ea y) ox ) andy” y) OyOu (Ce, y) 0. Oy? (a y) Y 
Consequently, 
5 60 V2 J/2 5 0 
Af (—1,0) = Af( —1, —) = Af( -—1,-—) = ; 
r-10=() 8). e(-1%2)=Hr(-1-2)=(5 4) 


The Hessians are diagonal; the one of P; is indefinite, so P; is a saddle point; 
the other two are positive definite, and P2, P3 are local minimum points. 
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d) The partial derivatives read 


- (a,y) = (1-2) e568, Say) = (1-2) e578, 


The equation V f(x,y) = 0 implies 


¢ 
1- =) =0 
a ( 5 
y 
1-2) =0, 
. ( 6 
hence P; = (0,0) and P; = (5,6) are the stationary points. Now, 


TS (ey) = ty (2-2) 04, 


Ox? 5° \5 
a? f a? f r OY ga 

= a (fp fiiee hess 
Aaa!) = Bpaae) (1-2) (0-Z)e# 4, 
Or 1 Yy _a2_y 
agri) = ge (G -2)2 ae 

so that 

0 1 ale 4 0 
Hf (0,0) = Hf(5,6)=( > 
10.0=(4 g), HGH=(“) go] 


Since det Hf (0,0) = —1 < 0, P; is a saddle point for f, while det Hf (5,6) = 
2 


e~4 > 0 and —(5,6) < 0 mean P is a relative maximum point. This last fact 


could also have been established by noticing both eigenvectors are negative, 
so the matrix is negative definite. 


e) The map is defined on the plane minus the coordinate axes x = 0, y= 0. As 


Of 1 8 Of x 
— —— —— —— 1 
V f(x,y) = 0 has one solution P = (—4,2) only. What this point is is readily 
said, for 
O7f 16 orf 0? f 1 any 2x 


Bg2 Oy) = a> Daag OY) = Byag OY) = =; Bz ew) = 


and 


: 3 oF 1 : 
Since det Hf(—4,2) = 16 > 0 and —>(-4,2) = 4 = 0, P is a relative 


maximum. 
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f) The function is defined on 


dom f = {(z,y) € R?:2-—27?>0}, 


which is the horizontal strip between the lines y = +2. From 


Of Ary Of 9 

Fg (bY) = ZF? Dy (x,y) = 2log(2 — x*) + 2y, 

equation V f(x,y) = 0 gives points P, = (1,0), P2 = (—1,0), P3 = (0, —log2). 
The second derivatives read: 


oF Ay(a? +2 GF 
sate) =P She) =2. 
6? f _ Of de 
Daag oY) _ Dyan oY) ae 
so 
0 —-4 0 4 
HfLQ=(", >) HCLO=(4 3). 
_ [2log2 0 
Af (0, —log2) = ( ‘ >) 


As det Hf (1,0) = det Hf(—1,0) = —16 < 0, Py and P» are saddle points; P3 
is a relative minimum because the Hessian Hf(P3) is positive definite. 


e) Using 
Of = 3a? —6ary+2y? Of = 2 3x7 —6ryt+2y? 
Fg try) _ 6(x y)e ’ dy (x, y) — 6(y axe ’ 


V f(x,y) = 0 produces two stationary points P; = (0,0), P2 = (1,1). As for 
second-order derivatives, 


; 2 
oF (a, y) = 6(1 + 6(a — y)?) 3” Set 20" 
O? 62 _ 
aay” = man y) = 6(—1+4 6(y? — 2)(a — ye “Sevt 
: 2 
- (x,y) = 6(2y + 6(y? — x)?)e3* —Or9 420" 
sO 


6 —6 6e-> = —6e7} 
Af (0, 0) = (", 0 ) ’ Af(1, 1) = Ge 12e—1 ) : 


The first is a saddle point, for det Hf (0,0) = —36 < 0; as for the other point, 
2 


a) 
det Hf (1,1) = 36e7! > 0 and a 1) = 6e | > 0 imply Pp» is a relative 
Hi 


minimum. 
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h) The function is defined on x > 0. The derivatives 


Of _y Of 7 
5g ry) = o> By woe) — Pee 


are zero at the point P = (1,0). Since 


a7), (a2 Seis jae ee aa = 
Ox2 oY 2? dx dy oY yOu oY ge Oy? iY ’ 
we have 
Hy(o=(" ¢ 
a oe es ia 


and det Hf (1,0) = —1 <0, telling that P is a saddle point. 
i) There are no stationary points because 


20 2y ) 


View) = (ayo wa sd 


vanishes only at (0,0), which does not belong to the domain of f; in fact 
dom f = {(z,y) € R? : x7 + y? > 1} consists of the points lying outside the 
unit circle centred in the origin. 


() Putting 


1 1 Of 1 


Of _ 
ae? By ere) ae Bz hr ¥r2) = ty — 


i ) 

a =_ Ze 
aa L,Y, % Y 
all equal 0 gives P; = (1,1,1) and P, = —P,. Moreover, 


2 2 2 
Fone: y) _ i fuy(ey) = yp’ fz(2, y) = 23 


gl = je = 2 it) aie =U: phe) =fa=o, 


so the Hessians Hf(P,), Hf(P2) are positive definite (the eigenvalues are 
Ai = 1, with multiplicity 2, and Ay = 4 in both cases). Therefore P; and P» 
are local minima for f. 


24. The domain reads 


dom f = {(a,y) € R? : y* — 2? > 0}. 


The inequality y? — x? > 0 is (y — x)(y+x) > 0, satisfied if the factors (y — 2), 
(y+) have the same sign. Thus 


dom f = {(z,y) € RR? :y >a and y > —a}U {(z,y) ER? :y <a andy < —2}, 


see Fig. 5.10. 
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Figure 5.10. Domain of f(z, y) = V/y? — x? 


The first derivatives are 


Of xe Of y 
r) (x,y) — ’ (x,y) = : 
v /y? — x Oy /y2 — x 
The lines y = x, y = —z are not contained in the domain of the partial derivatives, 


preventing the possibility of having stationary points. 
On the other hand it is easy to see that 


fee) =f (4-2) =0 and f(x,y) >0, V(x, y) € dom f. 


All points on y = xz and y = —2, ie., those of coordinates (x, x) and (x, —2), are 
therefore absolute minima for f. 


25. First of all the function is defined on 
dom f = {(z,y) € R?:1+y> 0}, 


which is the open half-plane determined by the line y = —1. 
Secondly, the points annihilating the gradient function 


6) 0 2 
vite) = (Flea), Sheu) = (2etog( +0) +202, = 


+ 20) 
] 


are the solutions of ; 
2x (log(1 + y)+y°) =0 


1 

2 

x —+2 —0. 
(= v) 
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We find (0, y), with y > —1 arbitrary. Thirdly, we compute the second derivatives 
Oy 2 
Aya (ty) =2 (log(1+y)+y*) , 
0? f 0? f 1 
—— =— =2 — +2 
Daag oY) yon Y) x roy ale 


These tell that the Hessian at the stationary points is 


2 (log(1 + y) + y’) 5) 


Af(0,y) = ( ; ; 


which unfortunately does not help to understand the points’ nature. 

This can be accomplished by direct inspection of the function. Write f(x,y) = 
a(x)8(y) with a(x) = x? and 8(y) = log(1+y)+y?. Note also that f(0,y) = 0, for 
any y > —1. It is not difficult to see that 6(y) > 0 when y > 0, and 6(y) < 0 when 
y < 0 (just compare the graphs of the elementary functions y(y) = log(1 + y) and 
w(y) = —y?). For any (2, y) in a suitable neighbourhood of (0, y) then, 


f(x,y) > 0 if y>O and f(x,y) <0 if y<0O. 


In conclusion, for y > 0 the points (0,y) are relative minima, whereas for y < 0 
they are relative maxima. The origin is neither. 
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Differential calculus for vector-valued functions 


In resuming the study of vector-valued functions started in Chapter 4, we begin 
by the various definitions concerning differentiability, and introduce the Jacobian 
matrix, which gathers the gradients of the function’s components, and the basic 
differential operators of order one and two. Then we will present the tools of differ- 
ential calculus; among them, the so-called chain rule for differentiating composite 
maps has a prominent role, for it lies at the core of the idea of coordinate-system 
changes. After discussing the general theory, we examine in detail the special, 
but of the foremost importance, frame systems of polar, cylindrical, and spherical 
coordinates. 

The second half of the chapter devotes itself to regular, or piecewise-regular, 
curves and surfaces, from a differential point of view. The analytical approach, 
that focuses on the functions, gradually gives way to the geometrically-intrinsic 
aspects of curves and surfaces as objects in the plane or in space. The fundamental 
vectors of a curve (the tangent, normal, binormal vectors and the curvature) are 
defined, and we show how to choose one of the two possible orientations of a curve. 
For surfaces we introduce the normal vector and the tangent plane, then discuss 
the possibility of fixing a way to cross the surface, which leads to the dichotomy 
between orientable and non-orientable surfaces, plus the notions of boundary and 
closed surface. All this will be the basis upon which to build, in Chapter 9, an 
integral calculus on curves and surfaces, and to establish the paramount Theorems 
of Gauss, Green and Stokes. 


6.1 Partial derivatives and Jacobian matrix 


Given x 9 € dom f, suppose every component f; of f admits at xo all first partial 
derivatives aa j=l1,...,n, so that to have the gradient vector 
Lj 
Of; Ofi Ofi 
Vifile = x0) = ( Di lixaus xo) 
fi(@o) On; (20) 1<j<n Ox (20) OF (20) 


here written as row vector. 
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Definition 6.1 The matrix with m rows and n columns 


is called Jacobian matrix of f at the point xo. 


The Jacobian (matrix) is also indicated by Jy(ao) or Df (xo). 
In particular if f = f is a scalar map (m = 1), then 


Jf (xo) = Vf (0) - 


Examples 6.2 
i) The function f : R? > R?, f(x,y, z) = cyzit+ (a?+y?4+27) 7 has components 
filx,y, z) = xyz and fo(a, y, z) = 22+y?+27, whose partial derivatives we write 
as entries of 
ye we wy 
JTC. 2) = : 
F(2,y,2) 2y | 


ii) Consider 


f:R° SR’, f(x) =Ax+b, 
where A = (a;;)1<i<m € R™*” is an m x n matrix and b = (bj )1<i<m € R™. 
1<j<n 
The ith component of f is 


fila) = SO aigay +i, 
j=l 
Ofi 


uy 


whence (a) = a, for all j =1,...,n 2 € R”. Therefore Jf(x) = A. 


6.2 Differentiability and Lipschitz functions 


Now we shall see if and how the previous chapter’s results extend to vector-valued 
functions. Starting from differentiability, let us suppose each component of f is 
differentiable at x € dom f (see Definition 5.5), 


fil) = filo) + Vfi(xo) - (@ — #0) + ole — xol|),  & > o 


for any i = 1,...,n. The dot product V f;(ao) - (a — ao) is to be thought of as a 
matrix product between the row vector V f;(ao) and the column vector x — xo. 
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By definition of Jacobian matrix, we may write, vectorially, 


f(x) = f(xo) + If (xo)(# — xo) + of||a — woll), (6.1) 


One says then f is differentiable at xo. 
Let us consider a special case: putting Ax = x — ao, we rewrite the above as 


f(xo + Ax) = f(xo) + Jf (x0) Ax + o(|| Axl) , Az — 0; 
the linear map df, from R” to R™ defined by 


dfa,: Av Jf (xo) Ax 


is called differential of f at ao. Up to infinitesimals of order greater than one, 
the formula says the increment Af = f(a + Ax) — f(a) is approximated by the 
value of the differential df, = Jf (xo) Az. 


As for scalar functions, equation (6.1) linearises f at xo, written 


f(x) ~ F(x0) + Jf (xo) (x — xo) 


on a neighbourhood of xo; in other words it approximates f by means of a degree- 
one polynomial in x (the Taylor polynomial of order 1 at ao). 

Propositions 5.7 and 5.8 carry over, as one sees by taking one component at a 
time. 


Also vectorial functions can be Lipschitz. The next statements generalise Defin- 
ition 5.14 and Proposition 5.16. Let R be a region inside dom f. 


Definition 6.3 The map f is Lipschitz on R if there is a constant L > 0 
such that 


|F(x1) — F(x2)|| < Llle1— al], Vai, a2 eR, (6.2) 


The smallest such L is the Lipschitz constant of f on R. 


Clearly f is Lipschitz on R if and only if all its components are. 


Proposition 6.4 Let R be a connected region in dom f. Suppose f is dif- 
ferentiable on such region and assume there is an M > 0 such that 


Ofi 


Oj 


(@)| <M, Vee Rit — ans la a 


Then f is Lipschitz on R with L=./nmM. 
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The proof is an easy consequence of Proposition 5.16 applied to each component 
of f. 

Vector-valued functions do not have an analogue of the Mean Value The- 
orem 5.12, since there might not be an % € S[a,b] such that f(b) — f(a) = 
Jf (%)(b — a) (whereas for each component f; there clearly is a point %; € S|a, bj 
satisfying (5.11)). A simple counterexample is the curve f : R > R?, f(t) = (t?,t3) 
with a = 0, b= 1, for which f(1) — f(0) = (1,1) but Jf(t) = (2t, 3¢7). We cannot 
find any # such that Jf(£)(b — a) = (24,34) = (1,1). 

Nevertheless, one could prove, under hypotheses similar to Lagrange’s state- 
ment 5.12, that 


|F(6) — fla)|| < sup ||Jf(x)|| ]b— all, 


xeESla,b] 
where the Jacobian’s norm is defined as in Sect. 4.2. 


At last, we extend the notion of functions of class C”, see Sect. 5.4. A map f is 
of class Ck (0<k< oo) on the open set 92 C dom f if all components are of class 
C* on 2; we shall write f € (C*(2))". A similar definition is valid if we take 2 
instead of 92. 


6.3 Basic differential operators 
Given a real function y, defined on an open set {2 in R” and differentiable on 2, 
we saw in Sect. 5.2 how to associate to such a scalar field on 2 the (first) partial 


O 
derivatives a with respect to the coordinates x;, 7 = 1,...,n; these are still 
Lj 


O 
scalar fields on (2. Each mapping y > — is a linear operator, because 
Lj 


a ap. ow 
De pa Or ee =A5n tH oe 


for any pair of functions y, w differentiable on 2 and any pair of numbers A, p € R. 

The operator maps C!(Q) to C°(Q): each partial derivative of a C! function on Q 

is of class C° on Q; in general, each operator Fp, maps C"( 2) te CP (2), for any 
x . 


j 
ke 1. 


6.3.1 First-order operators 


Using operators involving (first) partial derivatives we can introduce a host of 
linear differential operators of order one that act on (scalar or vector) fields defined 
and differentiable on 2, and return (scalar or vector) fields on 2. The first we wish 
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to describe is the gradient operator; as we know, it associates to a differentiable 


scalar field the vector field of first derivatives: 


0 0 6) 
grady = Ve = (5%) =F et te en. 
UG) sje OF Ln 


Thus if » € C!(Q), then grady € (C°(Q))", meaning the gradient is a linear 
operator from C1(2) to (C°(Q))". 


Let us see two other fundamental linear differential operators. 


Definition 6.5 The divergence of a vector field f = fie, +---+ fren, 
differentiable on 92 C R”, is the scalar field 


foe em Ofn <a OF; 
div f = ae specs ae 2 oe (6.3) 


The divergence operator maps (C1(2))" to C°(Q). 


Definition 6.6 The curl of a vector field f = fit+ fog + fak, differentiable 
on 2 CR, is the vector field 


oul f = (52-52) i4 (S458) 54 (SR OA) 


Ox2 0x3 0x3 0x1 Ox1 Ox2 


j k 
ican 
Ox2 0x3 


fo f3 


(the determinant is computed along the first row). The curl operator maps 


(Oye to (c%2))°. Another symbol used in many European countries is 
rot, standing for rotor. 


We remark that the curl, as above defined, acts only on three-dimensional vector 
fields. In dimension 2, one defines the curl of a vector field f, differentiable on an 
open set 2 C R?, as the scalar field 


Cap, GM 
| geen chee a 
curl f Bis, ls (6.5) 
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Note that this is the only non-zero component (the third one) of the curl of 
® (x1, 02,73) = fi(v1, ©2)t + fo(a1,2%2)j + Ok, associated to f; in other words 


curl® = 01+ 07 + (curl f)k. (6.6) 


Sometimes, in dimension 2, the curl of a differentiable scalar field y on an open 
set QC R? is also defined as the (two-dimensional) vector field 


Here, too, the definition is suggested by a suitable three-dimensional curl: setting 
® (21, 12,13) = 0 + 07 + Y(21, 22)k, we see immediately 


curl@ = curly + Ok. 


Higher-dimensional curl operators exist, but go beyond the purposes of this book. 


Occasionally one finds useful to have a single formalism to represent the three 
operators gradient, divergence and curl. To this end, denote by V the symbolic 


vector whose components are the partial differential operators me. 


v= (=) = 
O25] y<j<n 


L41 OLn, 


In this way the gradient of a scalar field y, denoted Vy, may be thought of as 
obtained from the multiplication (on the right) of the vector V by the scalar y. 
Similarly, (6.3) shows the divergence of a vector field f is the dot product of the 
two vectors V and f, whence one writes 


In dimension 3 at last, the curl of a vector field f can be obtained, as (6.4) suggests, 
computing the cross product of the vectors V and f, allowing one to write 


Let us illustrate the geometric meaning of the divergence and the curl of a 
three-dimensional vector field, and show that the former is related to the change 
in volume of a portion of mass moving under the effect of the vector field, while 
the latter has to do with the rotation of a solid around a point. So let f : R? — R® 
be aC! vector field, which we shall assume to have bounded first derivatives on the 
whole R?. For any x € R? let ®(t, x) be the trajectory of f passing through & at 
time t = 0 or, equivalently, the solution of the Cauchy problem for the autonomous 
differential system 
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= f(®), t>0, 
b(0, 2) = 2. 


We may also suppose that the solution exists at any time t > 0 and is differentiable 
with continuity with respect to both t and & (that this is true will be proved in 
Ch. 10). Fix a bounded open set 29 and follow its evolution in time by looking at 
its images under ® 


% = €8,20) ={z= B(x): 2 € No}. 


We will introduce in Chapter 8 the triple integral of a map g defined on (2, 
| g(x,y, 2) da dy dz, 
2 


and show that, when g is the constant function 1, the integral represents the 
volume of the set §2;. Well, one can prove that 


— | dedydz = [ div f dxdydz, 
dt on Qa 


which shows it is precisely the divergence of f that governs the volume variation 
along the field’s trajectories. In particular, if f is such that div f = 0 on R?, the 
volume of the image of any open set (29 is constant with time (see Fig. 6.1 for a 
picture in dimension two). 


Y 


2 
av 

AY ea 
a 

iY ee 
x 


Figure 6.1. The area of a surface evolving in time under the effect of a two-dimensional 
field with zero divergence does not change 


208 6 Differential calculus for vector-valued functions 


Let now f(x) = Az be a particular rigid motion of a three-dimensional solid 
S, namely a (clockwise) rotation around the z-axis by an angle 6. Then A is 
orthogonal (viz. it does not affect distances); precisely, it takes the form 
cos@ sind O 
A= | -sin@ cosé 0 
0 0 1 
It is easy to see that 
curl f = 072+ 07 — 2sin0k; 


so the curl of f has only one non-zero component, in the direction of rotation, 
that depends on the angle. 


After this digression we return to the general properties of the operators gradi- 
ent, divergence and curl, we observe that also the latter two are linear, like the 
gradient 


div (Af + ug) = Adiv f + udivg, 


curl (Af + wg) = Acurl f + wcurlg 


for any pair of fields f, g and scalars A, w. Moreover, we have a list of properties 
expressing how the operators interact with various products between (scalar and 
vector) fields of class C?: 


grad (pw) = ygrady + pgrady, 
grad (f-g)=gJf +f Jg, 
div (yf) =grady- f + pdivf, 


div(f \g) =g-curlf —f-curlg, 


curl(yf) = gradyA f+vycurl f, 
curl(f \g)=fdivg—gdivf+gJf—fJg. 


Their proof is straightforward from the definitions and the rule for differentiating 
a product. 

In two special cases, the successive action of two of grad, div, curl on a 
sufficiently regular field gives the null vector field. The following results ensue 
from the definitions by using Schwarz’s Theorem 5.17. 
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Proposition 6.7 i) Let y be a scalar field of class C? on an open set Q of 
R°. Then 


curl grad yy = VA (Vy) =0 OMe 


it) Let ® be a C? vector field on an open set Q of R°. Then 


diveurl@ =V-(VA®)=0 nee 


The two-dimensional version of the above results reads 


Proposition 6.8 Let y be a scalar field of class C? on an open set 2 in R?. 
Tey 
curl grad y = 0 and div curly = 0 on 22. 


These propositions steer us to the examination of fields with null gradient, 
null curl or zero divergence on a set §2, a study with relevant applications. It is 
known (Proposition 5.13) that a scalar field y has null gradient on 9 if and only 
if y is constant on connected components of (2. For the other two operators we 
preliminarly need some terminology. 


Definition 6.9 i) A vector field f, differentiable on an open set Q in R? 
and such that curl f = 0, is said irrotational (or curl-free) on 92. 

ii) A vector field f, differentiable on an open set 2 of R” and such that 
div f = 0 is said divergence-free on 12. 


Definition 6.10 i) A vector field f on an open set 2 of R" is conservative 
in 92 if there exists a scalar field p such that f = grady on (2. The 
function ¢ is called a (scalar) potential of f. 

it) A vector field f on an open set Q of R® is of curl type if there exists 
a vector field ® such that f = curl® on 92. The function ® is called a 
(vector) potential for f. 


Taking these into consideration, Proposition 6.7 modifies as follows: i) if a C? 
vector field f on an open set Q of R® is conservative (and so admits a scalar 
potential of class C?), then it is necessarily irrotational. ii) If aC vector field f on 
an open set 2 of R? admits a C? vector potential, it is divergence-free. Equivalently, 
we may concisely say that: i) f conservative implies f irrotational; ii) f of curl 
type implies f divergence-free. 

The natural question is whether the above necessary conditions are also suf- 
ficient to guarantee the existence of a (scalar or vectorial) potential for f. The 
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answer will be given in Sect. 9.6, at least for fields with no curl. But we can say 
that in the absence of additional assumptions on the open set 2 the answer is neg- 
ative. In fact, open subsets of R? exist, on which there are irrotational vector fields 
of class C! that are not conservative. Notwithstanding these counterexamples, we 
will provide conditions on 9 turning the necessary condition into an equivalence. 
In particular, each C! vector field on an open, convex subset of R® (like the in- 
terior of a cube, of a sphere, of an ellipsoid) is irrotational if and only if it is 
conservative. 

Comparable results hold for divergence-free fields, in relationship to the exist- 
ence of vector potentials. 


Examples 6.11 
i) Let f be the affine vector field 
7:2? =P’, f(x) = Ax+b 
defined by the 3 x 3 matrix A and the vector b of R®. Immediately we have 


div f = a11 + a22 + G33, 


curl f = (a32 — a23)t + (a31 — a13)J + (G21 — aia)k. 
Therefore f has no divergence on R® if and only if the trace of A, trA = 
a1 + @22 + a33, is zero. The field is, instead, irrotational on R? if and only if A 
is symmetric. 
Since R? is clearly convex, to say f has no curl is the same as asserting f is 
conservative. In fact if A is symmetric, a (scalar) potential for f is 


ole) = 50 Aw—b-a. 


Similarly, f is divergence-free if and only if it is of curl type, for if tr A is zero, 
a vector potential for f is 


1 1 
®(x) = 3 (Aa) Na+ go Ae. 
Note at last the field 


fle) =(y+2)t+ w—2z)5+ (e@-y)k, 
corresponding to 


0 1 1 
A=|1 0 -1] and b=0, 
1 -1 0 


is an example of a simultaneously irrotational and divergence-free field. 


ii) Given f € (c1(2))°, and ao € §2, consider the Jacobian Jf(xo) of f at xo. 
Then (div f)(xo) = 0 if and only if tr (Jf(ao)) = 0, while (curl f) (xo) = 0 if 
and only if Jf(ao) is symmetric. In particular, the curl of f measures the failure 
of the Jacobian to be symmetric. 
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Finally, we state without proof a theorem that casts some light on Defini- 
tions 6.5 and 6.6. 


Theorem 6.12 Let 2 be an open convex subset of R°. Every C! vector field 
on §2 decomposes (not uniquely) into the sum of an irrotational field and a 


field with no divergence. In other terms, for any f € (c1(2))° there exist 
FOO € Coy with curl f”) = 0 on Q, and ftfre?) EoOy with 


div f(4free) — 0 on Q, such that 
= eo ze yo ; 


Such a representation is known as Helmholtz decomposition of f. 


Example 6.13 


We return to the affine vector field on R* (Example 6.11 i)). Let us decompose 
the matrix A in the sum of its symmetric part AY” = 4(A+ A”) and skew- 
symmetric part A(skew) — (A — A’), 
A= Alsy™) ue Alskew) ; 

Setting f(x) = ACYMa+band f(trslred (x) = Alskev) x realises the Helm- 
holtz decomposition of f: f"” is irrotational as AY” is symmetric, f(s") 
is divergence-free as the diagonal of a skew-symmetric matrix is zero. Adding to 
A9™) an arbitrary traceless diagonal matrix D and subtracting the same from 
Alskew) gives new fields f"”) and f(“%’fre®) for a different Helmholtz decom- 
position of f. 


6.3.2 Second-order operators 


The consecutive action of two linear, first-order differential operators typically pro- 
duces a linear differential operator of order two, obviously defined on a sufficiently- 
regular (scalar or vector) field. We have already remarked (Proposition 6.7) how 
letting the curl act on the gradient, or computing the divergence of the curl, of 
a C? field produces the null operator. We list a few second-order operators with 
crucial applications. 


i) The operator div grad maps a scalar field y € C?() to the C°(Q) scalar field 


div grady = V-Vy= De ane (6.8) 


sum of the second partial derivatives of y of pure type. The operator div grad is 
known as Laplace operator, or simply Laplacian, and often denoted by A. 
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Therefore 


and (6.8) reads Ay. Another possibility to denote Laplace’s operator is V7, which 
intuitively reminds the second equation of (6.8). Note V?y cannot be confused 
with V(Vy), a meaningless expression as V acts on scalars, not on vectors; V7 
is purely meant as a shorthand symbol for V - (Vy). 

A map ¢ such that Ay = 0 on an open set 2 is called harmonic on (2. Harmonic 
maps enjoy key mathematical properties and intervene in the description of several 
physical phenomena. For example, the electrostatic potential generated in vacuum 
by an electric charge at the point x9 € R® is a harmonic function defined on the 
open set R® \ {ao}. 

ii) The Laplace operator acts (component-wise) on vector fields as well, and one 


sets 
Af = Afie: +---+ Afnen 


where f = fie1+--- fn€n. Thus the vector Laplacian maps (C?(2))" to (C°(Q))". 


iii) The operator grad div transforms vector fields into vector fields, and precisely 
it maps (C?(2))” to (C°(Q))”. 


iv) Similarly the operator curlcurl goes from (c2(2))° to (c9(2))°. 
The latter three operators are related by the formula 


Af — grad div f + curlcurl f =0. 


6.4 Differentiating composite functions 


Let 


f :dom f C R” > R” and g:domg C R” > R? 


be two maps and ao € dom f a point such that yo = f(ao) € domg, so that the 
composite 


h=gof:domhCR” > R’, 
for x9 € dom h, is well defined. We know the composition of continuous functions 
is continuous (Proposition 4.23). 
As far as differentiability is concerned, we have a result whose proof is similar 
to the casen = m= p= 1. 


Theorem 6.14 Let f be differentiable at x) € domh and g differentiable at 
yo = f(x0). Thenh=gof is differentiable at xo and its Jacobian matrix is 


J(g 0 f)(xo) = Jg(yo) Jf (xo) (6.9) 
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Note Jf(xo) is m x n, Jg(yo) is p x m, hence the product of matrices on the 
right-hand side is well defined and produces a p x n matrix. 

We shall make (6.9) explicit by writing the derivatives of the components of h 
in terms of the derivatives of f and g. For this, set # = (x;)1<j<n and 


y = f(x) = (fe(®))epem> 2 =9(¥) = (GY) eiep: 
2=Tie) = (hi(2)) icp: 
The entry on row i and column j of Jh(ao) is 


Oh; 
On; 


(a0) = 5 > 2% (ap) 24 (wp). (6.10) 


To remember these equations, we can write them as 


(6.11) 


also known as the chain rule for differentiating composite maps. 


Examples 6.15 


i) Let f = (fi, fo) : R? — R? and g: R? > R be differentiable. Call h = go f : 
R? + R the composition, h(x, y) = g(fi(a, y), fo(z,y)). Then 


Vh(ar) = Vo(F(a)) Jf (a): (6.12) 


putting u = fi(z,y), v = fo(x, y), this becomes 
Oh Og Of Og 


Bn y= Dy ere) Ba y) + Dy ves v) Bq (bY) 
Oh Og Of g Of 
FA) ( ’ ae oy. yt a i) By | ’ ) 


ii) Let py: I C R > R be a differentiable map and f : R? > R a scalar 
differentiable function. The composite h(x) = f(x,y (z)) is differentiable on J, 
and 


as follows from Theorem 6.14, since h = fo® with ® : I + R’, ®(x) = (a,y(z)). 


When a function depends on one variable only, the partial derivative symbol 
in (6.9) should be replaced by the more precise ordinary derivative. 
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iii) Consider a curve y : J C R > R? of differentiable components 7;, together 
with a differentiable map f : R° > R. Let h = foy: I > R be the composition. 
Then 


h'(t) = VF (y(t) -¥'(), (6.13) 
or, putting (x, y, z) = (4), 


Theorem 6.14 can be successfully applied to extend the one-variable res- 
ult, known to the student, about the derivative of the inverse function (Vol. I, 
Thm. 6.9). We show that under suitable hypotheses the Jacobian of the inverse 
function is roughly speaking the inverse Jacobian matrix. Sect. 7.1.1 will give us 
a sufficient condition for the following corollary to hold. 


Corollary 6.16 Let f : dom f C R” > R” be differentiable at xo with non- 
singular Jf(ao). Assume further that f is invertible on a neighbourhood of 
Xo, and that the inverse map f—' is differentiable at yo = f (ao). Then 


I(f-)(yo) = (JF (ao) - 


Proof. Applying the theorem with g = f~! will meet our needs, because 
h=f-'of is the identity map (h(x) = 2 for any x around 29), hence 
Jh(x0) = I. Therefore 


J(f~*)(yo) Jf (ao) =I, 


whence the claim, as Jf (ao) is invertible. O 


6.4.1 Functions defined by integrals 


We encounter a novel way to define a function, a way that takes a given map of 
two (scalar or vectorial) variables and integrates it with respect to one of them. 
This kind of map has many manifestations, for instance in the description of elec- 
tromagnetic fields. In the sequel we will restrict to the case of two scalar variables, 
although the more general treatise is not, conceptually, that more difficult. 

Let then g be a real map defined on the set R = I x J C R’, where J is an 
arbitrary real interval and J = [a, }| is closed and bounded. Suppose g is continuous 
on R and define 


(6.14) 
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this is a one-variable map defined on J, because for any x € I the function y > 
g(x, y) is continuous hence integrable on J. 

Our function f satisfies the following properties, whose proof can be found in 
Appendix A.1.3, p. 516. 


Proposition 6.17 The function f of (6.14) is continuous on I. Moreover, if 


g admits continuous partial derivative we on R, then f is of class C! on I 


Ox 


5 
Og 
a 6h y) dy. 


and 


This proposition spells out a rule for differentiating integrals: differentiating in 
one variable and integrating in the other are commuting operations, namely 


d f? oo) 
— | a(a,y)ay = | ae (@,y) dy. 


The above formula extends to higher derivatives: 


b Ok 
O 
M@)= f sets) ay, hel. 


provided the integrand exists and is continuous on FR. 


A more general form of (6.14) is 


(6.15) 


where a and £ are defined on J with values in |a, 0). 
Notice that the integral function 


f(a) = f oe 


considered in Vol. I, Sect. 9.8, is a special case. 
Proposition 6.17 generalises in the following manner. 


Proposition 6.18 If a and 6 are continuous on I, the map f defined 
by (6.15) ts continuous on I. If moreover g admits continuous partial de- 


O 
rivative a on R anda, B are C! on I, then f isC! on I, and 


B(z) 
f(x) = ii OF (x, y) dy + 8'(x)g(x,8 (@)) —a"(a)g(w,o(a)). (6.16) 
a(x) Ox 
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Proof. We shall only prove the formula, referring to Appendix A.1.3, p. 517, for 
the rest of the proof. 
Define on I x J? the map 


qd 
F(z, p,q) = g(x, y) dy ; 
Pp 


it admits continuous first derivatives everywhere on its domain 


OF % Og 

Dy oP q) = “ PAGED dy , 

OF OF 

Bp (Ps 4) = —9 FP)» Bq PD) = 9D 


The last two descend from the Fundamental Theorem of Integral Calculus. 
The assertion now follows from the fact that f(x) = F(a, (x), B(x)) by 
applying the chain rule 


Example 6.19 
The map 


is of the form (6.15) if we set g(x,y) = = a(x) = 2, B(x) = 2. As g is 
not integrable in elementary functions with respect to y, we cannot compute 
f(x) explicitly. But g is C! on any closed region contained in the first quadrant 
x >0,y > 0, while a and £ are regular everywhere. Invoking Proposition 6.18 
we deduce f is C* on (0,+00), with derivative 


2 


f'(@)=- [ ye 2Y dy + g(a, x”) 2x2 — g(a, x) 


ety y=07 2s 61 ts BB 
po oe 
Using this we can deduce the behaviour of f around the point x9 = 1. Firstly, 
f(1) = Oand f’(1) =e}; secondly, differentiating f’(a) once more gives f’”(1) = 
—7Te~!. Therefore 


f(a) = <(e— 1) ~ £1)? + o((w-1)), ei, 


implying f is positive, increasing and concave around Zo. 
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Pat = y(to + At) 


Figure 6.2. Tangent and secant vectors to a curve at Po 


6.5 Regular curves 


Curves were introduced in Sect. 4.6. A curve y : J] > R™ is said differentiable 
if its components x; : J > R,i <i < m, are differentiable on J (recall a map 
is differentiable on an interval J if differentiable at all interior points of J, and 
at the end-points of J where present). We denote by yy’ : J > R™ the derivative 


y(t) = (234))  cicm = ae a; (t)e. 


Definition 6.20 A curve 7: / — R”™ is regular if it is differentiable on I 
with continuous derivative (the components are C' on I) and if y'(t) 4 0, for 
anyt el. 


A curve y: I > R” is piecewise regular if I is the finite union of intervals 
on which vy is regular. 


If y is a regular curve and tg € J, we can interpret the vector y'(to) geomet- 
rically (Fig. 6.2). Calling Po = (to) and taking to + At € J such that the point 
Pat = y(to + At) is distinct from Po, we consider the lines through Pp and Pat; 
by (4.26), a secant line can be parametrised as 


S(t) = Po + (Pat - Po) = (to) + ato FA to) —to). (6.17) 


Letting now At tend to 0, the point Pa; moves towards Pp (in the sense that each 
component of Pa; tends to the corresponding component of P5). Meanwhile, the 
_ ¥(to + At) — ¥(to) 


vector o = a(to, At) = = tends to y'(to) by regularity. 
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The limit position of (6.17) is thus 


TAG) = sin aR) (ene =) teR, 


the tangent line to the trace of the curve at Po. For this reason we introduce 


Definition 6.21 Let y: I — R”™ be a regular curve, and to € I. The vector 


+'(to) is called tangent vector to the trace of the curve at Po = 7¥(to). 


To be truly rigorous, the tangent vector at Pp is the position vector (Po, 7’ (to)), 
but it is common practice to denote it by y'(to). Later on we shall see the tangent 
line at a point is an intrinsic object, and does not depend upon the chosen para- 
metrisation; its length and orientation, instead, do depend on the parametrisation. 

In kinematics, a curve in R? models the trajectory of a point-particle occupying 
the position y(t) at time t. When the curve is regular, -y’(t) is the particle’s velocity 
at the instant tf. 


Examples 6.22 
i) All curves considered in Examples 4.34 are regular. 


ii) Let y: J > R denote a continuously-differentiable map on J; the curve 
yH=(te@), tel, 
is regular, and has the graph of y as trace. In fact, 
¥(t) = (1, ¢'() 4 (0,0), for anyt€ I. 
iii) The are +: [0,2] + R? defined by 
oe { Cie. be 10,1), 


(t,t), ¢€ [1,2], 
parametrises the polygonal path ABC (see Fig. 6.3, left); the arc 
(Eli te [0,1), 
y(t) =< (60). t € [1,2), 


(4—t,2-4(t-2)), te [2,4], 
parametrises the path ABCA (Fig. 6.3, right). Both curves are piecewise regular. 


iv) The curves 
y(t) = (1+ v2cost,V2sint), — t € [0,27], 
y(t) = (1 + /2cos2t, —V2sin 2t) : t € [0,7], 


are different parametrisations (counter-clockwise and clockwise, respectively) of 
the same circle C, whose centre is (1,0) and radius V2. 
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A A 
C C 
1 i 
A |B | A 'B | 
| | = a 
O 1 2 O 1 2 


Figure 6.3. The polygonal paths ABC (left) and ABCA (right) of Example 6.22 iii) 


They are regular with derivatives 


At) = V2( —sint, cost) , A(t) = 2/2( — sin 2t, — cos 2t) . 
The point Po = (0,1) € C is the image under +¥ of to = Snr, and of to = air under 
4: Po = y(to) = ¥(to). In the first case the tangent vector is -y'(to) = (—1,—1) 
and the tangent line at Po 
T(t) = (0,1) -(1, 1)(¢- r) - (-t+$n1-t+ Sn), teR; 
in the second case ¥'(to) = (2,2) and 
T(t) = (0,1) + (2,2)(t — r) = (2(t - =m) 1+20¢= >) , ¢teR. 


The tangent vectors at Po have different length and orientation, but the same 
direction. Recalling Example 4.34 i), in both cases y = 1+ is the tangent line. 


v) The curve 7, : R — [0,+00) x R?, defined in spherical coordinates by 
T I; 
v(t) = (rb), e),00)) = (U1, mil at 5 Sin 8t), t) , 
describes the periodic motion, on the unit sphere, of the point P(t) that re- 
volves around the z-axis and simultaneously oscillates between the parallels of 
colatitude Ymin = 7 and Ymax = oa (Fig. 6.4). The curve is regular, for 


y,(t) = (0, 27 cos 8t, 1). q 
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Figure 6.4. A point moving on a sphere (Example 6.22 v)) 


6.5.1 Congruence of curves; orientation 


Now we discuss some useful relationships between curves parametrising the same 
trace. 


Definition 6.23 Let y: I — R™ and 6: J > R™ be regular curves. They 
are called congruent if there is a bijection py : J > I, differentiable with 
non-zero continuous derivative, such that 


Oo 


(hence 6(7) = y(v(r)) for any t € J). 


For the sequel we remark that congruent curves have the same trace, for x = 
6(r) if and only if a = y(t) with t = y(r). In addition, the tangent vectors at the 
point Py = +(to) = 6(70) are collinear, because differentiating 6(7) = y(y(r)) at 
To gives 

8'(r0) =! (70) e"(70) = ¥' (to)e"(r0) (6.18) 


with y’(7m) 4 0. Consequently, the tangent line at Pp is the same for the two 
curves. 

The map y, relating the congruent curves -y, 6 as in Definition 6.23, has y’ 
always > 0 or always < 0 on J; in fact, by assumption y’ is continuous and never 
zero on J, hence has constant sign by the Theorem of Existence of Zeroes. This 
entails we can divide congruent curves in two classes. 


Definition 6.24 The congruent curves y: I > R™, 6: J > R™ are equi- 


valent if the bijection py: J > I has strictly positive derivative, while they 
are anti-equivalent if iy! is strictly negative. 


Here is an example of two anti-equivalent curves. 
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Definition 6.25 Let y : I > R” be a regular curve. Denoting by —I the 
interval {t ER: —t € I}, the curve —y: —I > R™, (—7+)(t) = y(t) is said 


opposite to y, and is anti-equivalent to +. 


We may write (-y) = yoy, where y : —I — I is the bijection y(t) = —t. If 
y : [a,b] — R™ is a regular arc then —7 is still a regular arc defined on the 
interval [—b, —a]. 

If we take anti-equivalent curves -y and 6 we may also write 


8(r) = ¥(¥(7)) = 9 - (-¥(7))) = (-9) (H) 


where w : J > —I, w(7) = —y(r). Since ~’(7) = —y'(T) > 0, the curves 6 and 
(—) will be equivalent. In conclusion, 


Property 6.26 Congruent curves are either equivalent or one is equivalent 


to the opposite of the other. 


Due to this observation we shall adopt the notation 6 ~ vy to denote two 
equivalent curves, and 6 ~ —+ for anti-equivalent ones. 

By (6.18) now, equivalent curves have tangent vectors pointing in the same 
direction, whereas anti-equivalent curves have opposite tangent vectors. 


Assume from now on that curves are simple. It is immediate to verify that all 
curves congruent to a simple curve are themselves simple. Moreover, one can prove 
the following property. 


Proposition 6.27 If I’ denotes the trace of a simple, regular curve y, any 


other simple regular curve 6 having I’ as trace is congruent to 7+. 


Thus all parametrisations of I’ by simple regular curves are grouped into two 
classes: two curves belong to the same class if they are equivalent, and live in 
different classes if anti-equivalent. To each class we associate an orientation of 
I’. Given any parametrisation of I in fact, we say the point P, = y(tz2) follows 
P, = y(t1) on ¥ if te > ty (Fig. 6.5). Well, it is easy to see that if 6 is equivalent 
to y, P2 follows P; also in this parametrisation, in other words P) = 6(72) and 
P, = 6(71) with 72 > 7. Conversely, if 6 is anti-equivalent to y, then P2 = 6(tT2) 
and P; = 6(71) with tT < ™, so P; will follow Pp. 

As observed earlier, the orientation of [’ can be also determined from the 
orientation of its tangent vectors. 

The above discussion explains why simple regular curves are commonly thought 
of as geometrical objects, rather than as parametrisations, and often the two no- 
tions are confused, on purpose. This motivates the following definition. 
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P2 


Figure 6.5. An arc I’ in R® and an orientation on it 


Definition 6.28 A subset I’ of R™ is called a simple regular curve if it 


can be described as the trace of a curve with the same properties. 


If necessary one associates to I’ one of the two possible orientations. The choice 
of I’ and of an orientation on it determine a class of simple regular curves all 
equivalent to each other; within this class, one might select the most suitable 
parametrisation for the specific needs. 

Every definition and property stated for regular curves adapts easily to 
piecewise-regular curves. 


6.5.2 Length and arc length 


We define the length of the regular arc y : [a,b] — R™ as the number 


(6.19) 


The reason is geometrical (see Fig. 6.6). We subdivide [a,b] using a = to < ti < 
...,tK-1 <tx = 0 and consider the points P, = y(t.) € , k = 0,...,K. They 
determine a polygonal path in R™ (possibly degenerate) whose length is 


K 


£(éo, 41, ee ,tK) = S- dist (Peaigks)s 
k=1 


where dist (Px—1, Px) = || Px — Pr—i|| is the Euclidean distance of two points. Note 


m 


S~ (wi(te) — vi(te1))° 


1=1 


I| Pk — Pr—al| = 
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Px = y(tx) 


Figure 6.6. Approximation of the trace of an arc by a polygonal path 


where At; = t, —t,—1 and 


(S)) - (eegent). 


Therefore 


notice the analogy with the last integral in (6.19), of which the above is an approx- 
imation. One could prove that if the curve is piecewise regular, the least upper 
bound of ¢(to,t1,...,t«) over all possible partitions of [a,b] is finite, and equals 
f(y). 

The length (6.19) of an arc depends not only on the trace I’, but also on the 
chosen parametrisation. For example, parametrising the circle 2? + y? = r? by 
“i(t) = (rcost,rsint), t € [0,27], we have 


27 
an) = f rdt =2nrr, 


as is well known from elementary geometry. But if we take -y2(t) = (r cos 2t, r sin 2), 
t € [0, 27], then 


27 
ta) = i 2rdt = 4ar. 
0 


In the latter case we went around the origin twice. Recalling (6.18), it is easy to 
see that two congruent arcs have the same length (see also Proposition 9.3 below, 
where f = 1). We shall prove in Sect. 9.1 that the length of a simple (or Jordan) 
arc depends only upon the trace I’; it is called length of I’, and denoted by ¢(I’). 
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In the previous example 7y; is simple while ~y2 is not; the length @(I°) of the circle 
is e(m1). 


Let yy be a regular curve defined on the interval J, on which we fix an arbitrary 
point to € J, and introduce the function s:J => R 


(6.20) 


Recalling expression (6.19) for the length of an arc, we have 


L(y [to,t]) ’ t> to ’ 
s(t) = 0 5 t= to 5 
—L(y ran , b<to. 


The function s allows to define an equivalent curve that gives a new paramet- 
risation of the trace of y. By the Fundamental Theorem of Integral Calculus, in 
fact, 

s(th=|lY¥@||>0, Weel, 


so s is a strictly increasing map, and hence invertible on J. Call J = s(J) the 
image interval under s, and let t : J —~ I C R be the inverse map of s. In 
other terms we write ¢ as a function of another parameter s, t = t(s). The curve 
y:J—>R"™, ¥(s) = y(t(s)) is equivalent to y (in particular it has the same trace 
I). If P, = y(t1) is an arbitrary point on I’, then P; = ¥(s1) with t; and s, 
related by t; = t(s;). The number s, is the arc length of P,. 

Recalling the rule for differentiating an inverse function, 


4) = De = Zw) Lo= Fe. 


whence 
IW @ll=1, VseJ. (6.21) 


This means that the arc length parametrises the curve with constant “speed” 1. 
The definitions of length of an arc and arc length extend to piecewise-regular 

curves. 

Example 6.29 


The curve y : R > R°, y(t) = (cost,sint,t) has trace the circular helix (see 
Example 4.34 vi)). Then 


II’ (t)|| = ||(— sint, cost, 1)|| = (sin? t + cos?¢ + 1)/? = V2. 
Choosing to = 0, 


s(t) =| IIa (7)|| dr = vi f dr = V2t. 
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Therefore t = t(s) = V2 g, s € R, and the helix can be parametrised anew by arc 
length 


~ J2 . V2.2 
+(s)= ([ > 8 sin 5; 2.) ; Oo 


2 2 


6.5.3 Elements of differential geometry for curves 


This section is dedicated to the intrinsic geometry of curves in R®, and is not 
strictly essential for the sequel. As such it may be skipped at first reading. 

We consider a regular, simple curve I’ in R? parametrised by arc length s 
(defined from an origin point P*). Call ~ = y(s) such parametrisation, defined on 
an interval J C R, and suppose 7¥ is of class C? on J. 

If t(s) = y'(s) is the tangent vector to I’ at P = y(s), by (6.21) we have 


lt(s)|=1, VseJ, 


making t(s) a unit vector. Differentiating once more in s gives t/(s) = y'"(s), a 
vector orthogonal to t(s); in fact differentiating 


3 
le(s)|? =SoH(s)=1, Vee J, 
t=1 


we find 


ie., t(s) - t’(s) = 0. Recall now that if -+(s) is the trajectory of a point-particle 
in time, its velocity ¢(s) has constant speed = 1. Therefore t/(s) represents the 
acceleration, and depends exclusively on the change of direction of the velocity 
vector. Thus the acceleration is perpendicular to the direction of motion. 

If at Po = y(so) the vector t’(so) is not zero, we may define 


(6.22) 


called principal normal (vector) to I’ at Po. The orthonormal vectors t(sq) and 
n(so) lie on a plane passing through Po, the osculating plane to the curve I" at 
Po. Among all planes passing through Po, the osculating plane is the one that best 
adapts to the curve; to be precise, the distance of a point y(s) on the curve from 
the osculating plane is infinitesimal of order bigger than s — sg, as s > so. The 
osculating plane of a plane curve is, at each point, the plane containing the curve. 

The orientation of the principal normal has to do with the curve’s convexity 
(Fig. 6.7). If the curve is plane, in the frame system of t and n the curve can be 
represented around Po as the graph of a convex function. 
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a 


Figure 6.7. Osculating plane and tangent, normal, binormal vectors of I’ at Po 


The number K(s9) = ||t’(so)|| is the curvature of I’ at Po, and its inverse R(so) 
is called curvature radius. These names arise from the following considerations. 
For simplicity suppose the curve is plane, and let us advance by R(so) in the 
direction of n, starting from Po(so); the point C(so) thus reached is the centre 
of curvature (Fig. 6.8). The circle with centre C(so) and radius R(so) is tangent 
to I’ at Po, and among all tangent circles we are considering the one that best 
approximates the curve around Pp (osculating circle). 

The orthogonal vector to the osculating plane, 


b(so) = t(so) A n(so), (6.23) 


is known as the binormal (vector) to I at Po. This completes a positively-oriented 
triple (t, n, b) of orthonormal vectors, that defines a moving frame along a curve. 

The binormal unit vector, being orthogonal to the osculating plane, is constant 
along the curve if the latter is plane. Therefore, its variation measures how far the 
curve is from being plane. If the curve is C? it makes sense to consider the vector 
b’(so), the torsion vector of the curve at Po. Differentiating definition (6.23) 
gives 

b’(s0) _ t' (so) /\ n(so) + t(so) /\ n' (so) = t(so) /\ n' (so) 

as (so) is parallel to t’(so). This explains why n’(so) is orthogonal to t(so). At 
the same time, differentiating ||b(so)|/? = 1 gives b’(so) - b(so) = 0, making b’(so) 
orthogonal to b(sg) as well. Therefore b/(sg) is parallel to n(so), and there must 
exist a scalar T(so), called torsion of the curve, such that b’(so) = T(so)n(so). It 
turns out that a curve is plane if and only if the torsion vanishes identically. 

Differentiating the equation n(s) = b(s) A t(s) gives 


n'(s0) = B(so) A t'(so) + B’(s0) A t(50) 
= b(s0) A K(s0)n(so) + T(So)(S0) /\ t(so) 
= —K(so)t(so) — T(S9)b(so) - 
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Figure 6.8. Osculating circle, centre and radius of curvature at Po 


To summarise, the unit vectors t, n, b of aC® curve I satisfy the Frenet formulas 


(6.24) 


We conclude by pointing out that the vectors t,n,b admit a representation in 
terms of an arbitrary parametrisation of I’. 


6.6 Variable changes 


Let R be a region of R”. The generic point P € R is completely determined by its 
Cartesian coordinates 71, %2,..., £, which, as components of a vector x, allow to 
identify P = x. For i = 1,...,n the line 


Ry wt Rey = {ae = (015.2425 8 Ce + Pad, <2 5n) 2b S R} 
contains P and is parallel to the 7th canonical unit vector e;. Thus the lines R;, 


called coordinate lines through P, are mutually orthogonal. 


Definition 6.30 Let R’ be another region of R", with interior A’. A vector 
field ® : R’ + R defines a change of variables, or change of coordinates, 
on R if ® is: 


i) a bijective map between R’ and R; 

it) of class C1 on A’; 

iit) regular on A’, or equivalently, the Jacobian J® is non-singular everywhere 
on A’. 
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Figure 6.9. Change of variables in R 


Let A indicate the image ®(A’): then it is possible to prove that A is open, and 
consequently A C R; furthermore, ®(OR’) = ®(0A’) is contained in R \ A. 


Denote by Q = wu the generic point of R’, with Cartesian coordinates 
U1,-+-,Un. Given Py = a € A, part i) implies there exists a unique point 
Qo = uo € A’ such that Po = ®(Qo), or Lo = B(uo). Thus we may determ- 
ine Po, besides by its Cartesian coordinates 701,...,2Zon, also by the Cartesian 
coordinates Uo1,.--,Uon Of Qo; the latter are called curvilinear coordinates of 
Pp (relative to the transformation ®). The coordinate lines through Qo produce 
curves in R passing through Po. Precisely, the set 


D, = {a = ®(up + te;) : up + te; € R’ with t € I}, 


where 7 = 1,...,n and J; is an interval containing the origin, is the trace of the 
curve (simple, by i)) 
th Vile) = P(u0 + te;) 
defined on J;. These sets are the coordinate lines through the point Po (relative 
to ®); see Fig. 6.9. 
If Po € A these are regular curves (by iii)), so tangent vectors at Po exist 


(6.25) 


They are the columns of the Jacobian matrix of ® at uo. (Warning: the reader 
should pay attention not to confuse these with the row vectors Vy;(uo), which are 
the gradients of the components of ® = (—;)1<i<n.) Therefore by iii), the vectors 
T, 1 <i <n, are linearly independent (hence, non-zero), and so they form a basis 
of R”. We shall denote by t; = 7;/||7;|| the corresponding unit vectors. If at every 
Py € A the tangent vectors are orthogonal, i.e., if the matrix J®(uo)? J®(uo) is 
diagonal, the change of variables will be called orthogonal. If so, {t;,...,¢,} will 
be an orthonormal basis of R” relative to the point Pp. 
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Properties ii) and iii) have yet another important consequence. The map u +> 
det J®(w) is continuous on A’ because the determinant depends continuously upon 
the matrix’ entries; moreover, det J®(w) 4 0 for any u € A’. We therefore deduce 


det J®(u) >0, Vued’, or det J®(u) <0, VYueA’. — (6.26) 


In fact, if det J® were both positive and negative on A’, which is open and con- 
nected, then Theorem 4.30 would necessarily force the determinant to vanish at 
some point of A’, contradicting iii). 

Now we focus on low dimensions. In dimension 2, the first (second, respectively) 
of (6.26) says that for any uo € A’ the vector 7 can be aligned to 72 by a 
counter-clockwise (clockwise) rotation of 6 € (0, a]. Identifying in fact each 7; with 
7, = 7, + Ok € R®, we have 


Ti /\ To = ( det J®(uo))k, (6.27) 


orienting the triple (71, 72, k) positively (negatively). 

In dimension 3, the triple (71, 72, T3) is positively-oriented (negatively-oriented) 
if the first (second) of (6.26) holds; in fact, (7 A 72) - 73 = det J®(uo). 

Changes of variable on a region in R” are of two types, depending on the 
sign of the Jacobian determinant. If the sign is plus, the variable change is said 
orientation-preserving, if the sign is minus, orientation-reversing. 


Example 6.31 
The map 
z= @(u) = Au+b, 
with A a non-singular matrix of order n, defines an affine change of variables 
in R”. For example, the change in the plane (a# = (x,y), u = (u, v)) 


r= (vba) +2, y=e-u)4+4, 
is a translation of the origin to the point (2,1), followed by a counter-clockwise 
rotation of the axes by 7/4 (Fig. 6.10). The coordinate lines u = constant (v = 
constant) are parallel to the bisectrix of the first and third quadrant (resp. second 
and fourth). 


Figure 6.10. Affine change of variables in the plane 
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The new frame system is still orthogonal, confirmed by the matrix 
1 1 
_{( v2 v2 
v2 V2 
which is orthogonal since A? A = J. (In general, an affine change of variables 


is orthogonal if the associated matrix is orthogonal.) The change preserves the 
orientation, for det A=1> 0. 


6.6.1 Special frame systems 


We examine the changes of variables associated to relevant transformations of the 
plane and of space. 


i) Polar coordinates. Denote 
® : [0, +00) x R> R?, (r,0) + (x,y) = (rcos8,r sin 0) 


the map transforming polar coordinates into Cartesian ones (Fig. 6.11, left). It is 
differentiable, and its Jacobian 


cos@é —rsin@ 
(6.28) 
sin@d rcosé 
has determinant 
det J@®(r,0) =rcos?@+rsin?0=r. (6.29) 


Its positivity on (0,+00) x R makes the Jacobian invertible; in terms of r and 6, 
we have 


(6.30) 


sin@ cos@ 


cos 6 sin 6 


iP r 


Therefore, ® is a change of variables in the plane if we choose, for example, R = R? 
and R’ = (0,+00) x (—7,7] U {(0,0)}. The interior A’ = (0,+00) x (—7,77) has 
open image under ® given by A = R? \ {(z,0) € R? : x < O}, the plane minus 
the negative x-axis. The change is orthogonal, as J@? J&® = diag (1,r7), and 
preserves the orientation, because det J® > 0 on A’. 

Rays emanating from the origin (for 6 constant) and circles centred at the origin 
(r constant) are the coordinate lines. The tangent vectors 7;, 7 = 1,2, columns of 
J®, will henceforth be indicated by 7,., T and written as row vectors 


Tr = (cos6,sin 6), T) =1r(—sin6@,cos6) . 


Normalising the second one we obtain an orthonormal basis of R? 


(6.31) 
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4 Ya 


P= (z,y) 


Figure 6.11. Polar coordinates in the plane (left); coordinate lines and unit tangent 
vectors (right) 


formed by the unit tangent vectors to the coordinate lines, at each point P € 
R? \ {(0,0)} (Fig. 6.11, right). 


Take a scalar map f(x,y) defined on a subset of R? not containing the origin. 
In polar coordinates it will be 


f(r,0) = f(rcos6,rsin 8) , 


or f = fo®. Supposing f differentiable on its domain, the chain rule (Theorem 6.14 
or, more precisely, formula (6.12)) gives 


Vire)f = Viey J®, 


whose inverse is V f(z,y) = V er J®~—'. Dropping the symbol ~ for simplicity, 
those identities become, by (6.28), (6.30), 


Cid Of Op hi 


—=-rsind+ a cos 6 (6.32) 


cos? + —siné, pal aoe Dy 


Or Ox Oy 


and 


Oh ah) 


OF OF ing 4 OF 88 


(6.33) 


00 r 


Any vector field g(a, y) = gi (x, y)é + go(x,y)j on a subset of R? without the 
origin can be written, at each point (x,y) = (rcos@,rsin@) of the domain, using 
the basis {t,., to}: 


g = Ort, + goto (6.34) 


(here and henceforth the subscripts r, 8 do not denote partial derivatives, rather 
components). 
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As the basis is orthonormal, 


Gr = gt, = (git + gaj) -t, = g, cos? + go sind, 


(6.35) 
99 = 9 - to = (g1t + 929) + to = —gi sin# + go cos@. 
Of. Of. 
In particular, if g is the gradient of a differentiable function, grad f = Aan’! Ag Dy —}j, 
by (6.32) we have 
_ Of Of O of _ Of Of lof 
r— »j 6 — sin = — 0 os 9 = —-— ; 
de gg eh gee Bee ee ee oe 
therefore the gradient in polar coordinates reads 
O ile) 
grad f = a, + aati, (6.36) 


The divergence of a differentiable vector field g in polar coordinates is, similarly, 


Og. , Og2 _ OMi Ogi sin@ | O92 .. Og2 cos 6 
= = nee ap 
o> oe” op oe CO ae Ope Oe 
== (gi cos @ + go sin @)+ “leat gi sin @ + g2cos@)+ (91. cos 0 + gosin 8)| ; 


using (6.34) and (6.35), we find 


O>9r i 1 Ogo 
Be el 


divg = 


Combining this with (6.36) yields the polar representation of the Laplacian Af of 
a C? map on a plane region without the origin. Setting g = grad f in (6.37), 


O77. Loy ) 107 F 


Let us return to (6.34), and notice that — in contrast to the canonical unit 
vectors z, 7 — the unit vectors t,, tg vary from point to point; from (6.31), in 
particular, follow 


(6.39) 


To differentiate g with respect to r and @ then, we will use the usual product rule 
Og 09 Ogo 
ne ee Fe 
Or Or " - Or 


ag = (a — 90) + (Sp tt 


to, 
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ii) Cylindrical coordinates. Consider the C® transformation 


® : [0, +00) x R? > R*, (r,0,t) + (a, y, z) = (rcos6,rsin 8, t) 


describing the passage from cylindrical to Cartesian coordinates (Fig. 6.12, left). 
The Jacobian 


cosé —rsin@ 0 
J®(r,6,t)= | sind rcoséd 0 (6.40) 


0 0 1 


has determinant 

det J@®(r,0,t) =r, (6.41) 
strictly positive on (0, +00) x R?, so the matrix is invertible. Moreover, J®? J® = 
diag (1,77, 1). 

Thus ® is an orthogonal, orientation-preserving change of coordinates from 
R' = (0,+00) x (-7,7] x RU {(0,0,t) : t € R} to R = R*. The interior of 
R’ is A’ = (0,+00) x (—7,7) x R, whose image under ® is the open set A = 
R? \ {(x,0,z) € R?: x < 0, z € R}, the whole space minus half a plane. 

The coordinate lines are: horizontal rays emanating from the z-axis, horizontal 
circles centred along the z-axis, and vertical lines. The corresponding unit tangent 
vectors at P € R? \ {(0,0,z): z € R} are, with the obvious notations, 


t— (cos 0, sim.) tg = (—sin6,cos6,0), f= (0000 (6.42) 


They form an orthonormal frame at P (Fig. 6.12, right). 
A scalar map f(z, y, z), differentiable on a subset of R® not containing the axis 
z, 18 


77,0.) =f (¢ coed, 7 sind, 2); 


Zz 
A 
ti 
to 
Po 
t, 
k 
z 
P= (@,¥,0) : J y 


Figure 6.12. Cylindrical coordinates in space (left); coordinate lines and unit tangent 
vectors (right) 
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spelling out Vineet = Vf(a,y,z)J® gives 


—rsin? + OT eae) 
Oy 


_ Of sin® Ca 
OO or’ 7 


At last, this is how the basic differential operators look like in cylindrical coordin- 
ates 


O 10 
grad f = as pop nat te + Shh, 
oh 


10fo Of 


a eee 


- Of: afo Of, Of 
cut p= (222 ale ie. a) te 
a*f 10f 10°F Of 
a We ee ee 


tt 


div — 


Of 
( a r 00 


Ap = 


iii) Spherical coordinates. The function 


® : (0, +00) x R? > R®, (r,y,0) 6 (2, y,z) = (rsinycos6,rsinysin§,r cosy) 


is differentiable infinitely many times, and describes the passage from spherical 
coordinates to Cartesian coordinates (Fig. 6.13, left). As 


sinycos@ rcosycos@ —rsinysind 
J®&(r, 0,0) = sinysin@d rcosysin@ rsinwcosé (6.43) 


cos Y —rsing 0 


we compute the determinant by expanding along the last row; recalling sin? 6 + 
cos? @ = 1 and sin? y + cos? y = 1, we find 


det J@®(r,y,0) =r? sing, (6.44) 


strictly positive on (0, +00) x (0,7) x R. The Jacobian is invertible on that domain. 
Furthermore, J®? J® = diag (1, r(cos? y + rsin? y),r? siny). 

Then ® is an orthogonal, orientation-preserving change of variables mapping 
R' = (0,+00) x [0,7] x (—a,7] U {(0,0,0)} onto R = R®. The interior of R’ 
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\ 


€ 


P’ = (x,y, 0) 


Figure 6.13. Spherical coordinates in space (left); coordinate lines and unit tangent 
vectors (right) 


is A’ = (0,+00) x (0,7) x (—7,7), whose image is in turn the open set A = 
R? \ {(#,0,z) € R?: x < 0, z € R}, as for cylindrical coordinates. 

There are three types of coordinate lines, namely rays from the origin, vertical 
half-circles centred at the origin (the Earth’s meridians), and horizontal circles 
centred on the z-axis (the parallels). The unit tangent vectors at P € R°\{(0,0,0)} 
result from normalising the columns of J® 


t, = (sin ycos 6, sin y sin 0, cos y) , 
t, = (cosy cos 0, cos ysin 0, — sin y) , (6.45) 
tg = (—sin0,cos 0,0). 


They are an orthonormal basis of R? at the point P (Fig. 6.13, right). 


Given a scalar map f(x,y, z), differentiable away from the origin, in spherical 
coordinates : 


f(r,y,0) = f(rsinycos6,rsin ysin 6,r cosy), 
we express Vireo) f =Vfi(a,y,.J® (dropping ~) as 


Or Ox Oz 
Of 
Op Ox 

ORae SO Nas 


ane ~ at ub ese Sar sin pos. 


of = La ee Fe sin sind + OF cea 


O O O 
= I eee cone LSE er win sing 


Oy 
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The inverse relationships read 


OU eg, LE sin 6 
Or e Oy r 00 rsiny 
Chime : Of cospysind Of cosé 
— sin ysin é + ———— _ + — — 

ie Ov f 00 rsiny 


Gi. WOR ee 


Lae ° Fsing 00 °’ 

2 10f, | tangy 1 Of 

ug moe e r Nes etna ool 

tan y Nie i 
00 )tr Gare oo 


Or 


_ ef 208 langof, 1 of 
 Or2 r Or r2 Op r2 sin? p 00? © 


6.7 Regular surfaces 


Surfaces, and in particular compact ones (defined by a compact region R), were 
introduced in Sect. 4.7. 


Definition 6.32 A surface 0 : R > R? is regular if a is C! on A=R 
and the Jacobian matrix Jo has maximal rank (= 2) at every point of A. 


A compact surface is regular if it is the restriction to R of a regular surface 
defined on an open set containing R. 


o 
The condition on Jo is equivalent to the fact that the vectors —(uo, vo) and 


Ou 
) 
Fe (tos vo) are linearly independent for any (uo,vo) € A. By Definition 6.1, such 
v 


vectors form the columns of the Jacobian Jo 


Jo = (= =| | (6.46) 
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Examples 6.33 


i) The surfaces seen in Examples 4.37 i), iii), iv) are regular, as simple calculations 
will show. 


ii) The surface of Example 4.37 ii) is regular if (and only if) y is C' on A. If so, 


1 0 
[ 0 1 
Op Op 
Ou Ov 


whose first two rows grant the matrix rank 2. 
iii) The surface o : R x [0,27] > R°, 
o(u,v) =ucosvit+usinuj +uk 
parametrises the cone 
a? y” —z7=0. 
Its Jacobian is 
cosu— —usinv 
Jo=| sinv ucosv 
1 0 
As the determinants of the first minors are u, usinv, —ucosv, the surface is not 


regular: at points (wo, vo) = (0,vo), mapped to the cone’s apex, all minors are 
singular. 


Before we continue, let us point out the use of terminology. Although Defin- 
ition 4.36 privileges the analytical aspects, the prevailing (by standard practice) 
geometrical viewpoint uses the term surface to mean the trace ©’ C R® as well, 
retaining for the function o the role of parametrisation of the surface. We follow 
the mainstream and adopt this language, with the additional assumption that all 
parametrisations o be simple. In such a way, many subsequent notions will have 
a more immediate, and intuitive, geometrical representation. With this matter 
settled, we may now see a definition. 


Definition 6.34 A subset XY of R® is a regular and simple surface if © 


admits a regular and simple parametrisation 0 :R — 2’. 


Let then ©’ C R® be a regular simple surface parametrised by 0 : R > J, 
° 
and Py) = a(uo, vo) the point on »’ image of (uo, vo) € A = R. Since the vectors 


o 
—(Uuo, V0), ——(uo, Vo) are by hypothesis linearly independent, we introduce the 


Ov 


y+ = (uo, vo) (u — uo) + a (uo, vo) (v — v0) ; (6.47) 
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that parametrises a plane through Po (recall Example 4.37 i)). It is called the 
tangent plane to the surface at Po. Justifying the notation, notice that the 
functions 

Ur> o(U, Vo) and v +> o(uo, v) 


define two regular curves lying on 4’ and passing through Po. Their tangent vectors 


Oo Oo ; 
at Po are FC vo) and By (uo vo); therefore the tangent lines to such curves at 
u Vv 


Po lie on the tangent plane, and actually they span it by linear combination. In 
general, the tangent plane contains the tangent vector to any regular curve passing 
through Po and lying on the surface »’. 


Definition 6.35 The vector 


Oo Oo 
V (uo, vo) — Dy Uo? vo) /\ Fp (uo vo) (6.48) 


is the surface’s normal (vector) at Po. The corresponding unit normal will 


be indicated by n(uo, v0) = Soares 


Due to the surface’s regularity, v(uo,vo) is non-zero. It is also orthogonal to 


Oo Oo 
—(Uuo,vo0) and Foy (Uo vo); so if we compute it at Po it will be orthogonal to 
U 


Ou 
the tangent plane at Po; this explains the name (Fig. 6.14). 


Figure 6.14. Tangent plane J7 and normal vector v at Po 
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Figure 6.15. Normal vector of a surface 


Example 6.36 


i) The regular surface o : R > R3, o(u,v) = uitvj+y(u,v) k, with y € Cl(R) 
(see Example 6.33 ii)), has 


(6.49) 


Note that, at any point, the normal vector points upwards, because v-k > 0 
(Fig. 6.15). 

ii) Let o : [0, x] x [0, 27] + R? be the parametrisation of the ellipsoid with centre 
the origin and semi-axes a,b,c > 0 (see Example 4.37 iv)). Then 


Oo ; : : . 

By ur) = acosucosvi+bcosusinuvj —csinuk, 
u 

Oo ; bes a ' ; 

By (te v) = —asinusinvi+ bsinucosvj + 0k, 


whence 
v(u,v) = besin?ucosvi + acsin?usinvj + absinucosuk. 
If the surface is a sphere of radius r (when a =b=c=r), 
v(u,v) =rsinu(rsinucosvi+rsinusinvj +rcosuk) 


so the normal v at x is proportional to x, and thus aligned with the radial 
vector. 


Just like with the tangent to a curve, one could prove that the tangent plane is 
intrinsic to the surface, in other words it does not depend on the parametrisation. 
As a result the normal’s direction is intrinsic as well, while length and orientation 
vary with the chosen parametrisation. 
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6.7.1 Changing parametrisation 
Suppose »’ is regular, with two parametrisations 7: R— anda: RS. 


Definition 6.37 The parametrisations o and o are congruent if there 
is a change of variables ©: R > R such that ao =a o®. If 3) is a compact 


surface, ® is required_to be the restriction of a change of variables between 
open sets containing R and R. 


Although the definition does not require the parametrisations to be simple, we 
shall assume they are throughout. The property of being regular and simple is 
preserved by congruence. _ 

Bearing in mind the discussion of Sect. 6.6, if A and A are the interiors of R 
and R, then either 


det J@>0O on A or det J <0 on A. 


In the former case & is equivalent to o, while in the latter o is anti-equivalent 
to o. In other terms, an orientation-preserving (orientation-reversing) change of 
variables induces a parametrisation equivalent (anti-equivalent) to the given one. 


Oo 


Oo : : a oat 
The tangent vectors Oa and — can be easily expressed as linear combinations 
U 


a 

Oo Oo : ; : 

of — Pu and ae . As it happens, recalling (6.46) and the chain rule (6.9), and 
u 


omitting to write the point (uo, v0) = &(tio, vo) of differentiation, we have 


(Se S)-(e 
Ou Ov Ou Ov 
setting J® = ae ), these become 
mai mMe2 
Oo Oo Oo Oo Oo 4 Oo 
ai Ou au’ a au ay 
Oo Oo O 
They show, on one hand, that —— and ais generate the same plane of aad and 
3 Ou Ov Ou 
Se as prescribed by (6.47); therefore 
Vv 


the tangent plane to a surface &) is invariant for congruent parametrisations. 


On the other hand, the previous expressions and property (4.8) imply 


Oo 0a Oo Oa Oo 0a 
a ANS FE MUM 2 A = FMI1M22——- AZ + 
Ou Ov 


Ou Ou Ou Ov 
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Oo 0a Oo Oa 
TM21M12-5- /\ a. =F m21M22—— /\ Dv 
Oo Oa 
= (M11M22 — mizma1) A in? 


From these follows that the normal vectors at the point Po satisfy 
vy = (det Jb)v, (6.50) 


which in turn corroborates that 


two congruent parametrisations generate parallel normal vectors; the two ori- 
entations are the same for equivalent parametrisations (orientation-preserving 


change of variables), otherwise they are opposite (orientation-reversing 
change). 


6.7.2 Orientable surfaces 


Proposition 6.27 guarantees two parametrisations of the same regular, simple curve 
I’ of R™ are congruent, hence there are always two orientations to choose from. 
The analogue result for surfaces cannot hold, and we will see counterexamples (the 
Mobius strip, the Klein bottle). Thus the following definition makes sense. 


Definition 6.38 A regular simple surface 7 C R® is said orientable if any 


two regular and simple parametrisations are congruent. 


The name comes from the fact that the surface may be endowed with two ori- 
entations (naively, the crossing directions) depending on the normal vector. All 
equivalent parametrisations will have the same orientation, whilst anti-equivalent 
ones will have opposite orientations. It can be proved that a regular and simple 
surface parametrised by an open plane region FR is always orientable. 

When the regular, simple surface is parametrised by a non-open region, the 
picture becomes more complicated and the surface may, or not, be orientable. With 
the help of the next two examples we shed some light on the matter. Consider first 
the cylindrical strip of radius 1 and height 2, see Fig. 6.16, left. 

We parametrise this compact surface by the map 


o : [0,27] x [-1,1] > R°, o(u,v) =cosut+sinuj + vk, (6.51) 


regular and simple because, for instance, injective on [0, 27) x [—1, 1]. The associ- 
ated normal 
v(u,v) =cosut+sinuj + 0k 
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constantly points outside the cylinder. To nurture visual intuition, we might say 
that a person walking on the surface (staying on the plane xy, for instance) can 
return to the starting point always keeping the normal vector on the same side. 
The surface is orientable, it has two sides — an inside and an outside — and if the 
person wanted to go to the opposite side it would have to cross the top rim v = 1 
or the bottom one v = —1. (The rims in question are the surface’s boundaries, as 
we shall see below.) 

The second example is yet another strip, called Mobius strip, and shown in 
Fig. 6.16, right. To construct it, start from the previous cylinder, cut it along a 
vertical segment, and then glue the ends back together after twisting one by 180°. 
The precise parametrisation o : [0,27] x [-1,1] > R? is 


o(u,v) = (1- 5 008 =) cosut+ (1— 5 08 =) sinuj — ssin Sk. 

For u=0, ie., at the point Po = (1,0,0) = a (0,0), the normal vector is v = sk. 
As u increases, the normal varies with continuity, except that as we go back to 
Po = o(27,0) after a complete turn, we have v = —tk, opposite to the one we 
started with. This means that our imaginary friend starts from Po and after one 
loop returns to the same point (without crossing the rim) upside down! The Mébius 
strip is a non-orientable surface; it has one side only and one boundary: starting 
at any point of the boundary and going around the origin twice, for instance, we 
reach all points of the boundary, in contrast to what happens for the cylinder. 

At any rate the Mobius strip embodies a somehow “pathological” situation. 
Surfaces (compact or not) delimiting elementary solids — spheres, ellipsoid, cylin- 
ders, cones and the like — are all orientable. More generally, 


Proposition 6.39 Every regular simple surface 3) contained in the boundary 


OQ of an open, connected and bounded set Q C R® is orientable. 


We can thus choose for »’ either the orientation from inside towards outside 
(2, or the converse. In the first case the unit normal n points outwards, in the 
second it points inwards. 


Figure 6.16. Cylinder (left) and Mobius strip (right) 
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6.7.3 Boundary of a surface; closed surfaces 


The other important object concerning a surface is the boundary, mentioned above. 
This is a rather delicate notion if one wishes to discuss it in full generality, but we 
shall present the matter in an elementary way. 

Let ©’ Cc R? be a regular and simple surface, which we assume to be a closed 
subset 3’ = ¥7 of R? (in the sense of Definition 4.5). Then let o : R C R*? + ¥ be 


a parametrisation over the closed region R. Calling A = FR the interior of R, the 
image 3° = o(A) is a subset of 3’ = a(R). The difference set 


ee Ae Ta 


will be called boundary of »’ (relative to a). Subsequent examples will show 
that OX’, contains all points that are obvious boundary points of »’ in purely 
geometrical terms, but might also contain points that depend on the chosen para- 
metrisation; this justifies the subscript o. It is thus logical to define the boundary 
of »’ (viewed as an intrinsic object to the surface) as the set 


where the intersection is taken over all possible parametrisations (non-regular as 
well) of 7. We point out that the symbol 02 denoted in Sect. 4.3 the frontier of 
37, seen as subset of R*®; for regular surfaces this set coincides with 3’ = DY, since 
no interior points are present. Our treatise of surfaces will only involve the notion 
of boundary, not of frontier. Here are the promised examples. 


Examples 6.40 

i) The upper hemisphere with centre (0,0,1) and radius 1 

D = {(2,y,z) €R?: 2? +y? + (2-1)? =1,z22>1} 
is a closed set ©’ = XY’ inside R?. Geometrically, it is quite intuitive (Fig. 6.17, 
left) that its boundary is the circle 

Oy ={(2,y,z) ER: 2? +y?+(z-1)? =1,z=1}. 
If one parametrises »’ by 
ao: R={(u,v) €R?:u?4+v0? <1} > R®, o(u,v) = vit+vjt+(14+ Vu? 4 v2)k, 
it follows that 

de = {(2,y,2z) € R?: 2? +y? + (z-1)? =1,z> 1}, 
so 5) \ X’° is precisely the boundary we claimed. Parametrising »’ with spherical 
coordinates, instead, 


G:R= (0, = x [0,27] > R®, G(u,v) =sinucosvi+sinusinvj+(1+cosu)k, 


so A = (0, 5) x (0,27), we obtain X’3 as the portion of surface lying above the 
plane z = 1, without the equatorial arc joining the North pole (0, 0,2) to (1,0, 1) 
on the plane xz (Fig. 6.17, right); analytically, 
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Figure 6.17. The hemisphere of Example 6.40 i) 


Siwy 2 eR sey tel’ =1,2¢> 
\{(z,y,2z) € R?: 2 > 0, y=0, 2? + (2-1)? =1}. 
Besides the points on the geometric boundary, 22 consists of the aforementioned 
arc, which is image of part of the boundary of R; these points depend on the 
particular parametrisation we have chosen. 
ii) Let 
Y= {(ajyy,z) ER? : 2? +y? = 1, |e| <1} 

be the (closed) cylinder of Fig. 6.18, whose boundary 0» consists of the circles 
a? +y? =1,z=+1. Given uo € [0,27], we may parametrise Y by 


Ou, :(0,2n] x [-1,1] > R°, oy, (u,v) = cos(u — up) i+ sin(u — uo) j + vk, 

generalising the parametrisation o = oo seen in (6.51). Then 
vou, = Ut ys 2) € R?: a? +y? =1,|z| <1} 
\{(cos ug, sin uo, z) € R?: |z| < 1}, 
and 02/,,,, contains, apart from the two circles giving 02’, also the vertical 
segment {(cos ug, sin uo, 2) € R® : |z| < 1}. Intersecting all sets 0X, — any two 
of them is enough — gives the geometric boundary of 2’. 
iii) Consider at last the surface 
= {(x, y, 2) ER: (2,y) €T,0<z< ee 

where I" is the Jordan arc in the plane xy defined by 

y:(Y>R*, u@=e(t)=t1—-t) and p(t) = Y(t) = (1—-t)t 
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Figure 6.18. The cylinder relative to Example 6.40 ii) 


(see Fig. 6.19, left). Let us first parametrise ©’ with o : R — R°, where R = 
[0,1]? and o(u,v) = (y(u),v(u),v). It is easy to convince ourselves that the 
boundary 02’, of o is OXg = Ip UI, U So, where Ig =I x {0}, 1) =I x {1} 
and So = {(0,0)} x [0,1] (Fig. 6.19, right). But if we extend the maps y and 
W periodically to the intervals [k,k + 1], we may consider, for any uo € R, the 
parametrisations o,,, : R — R® given by oy,(u,v) = (y(u — uo), Y(u — uo), z), 
for which p= Lo ULL U Sug, Sup = {y(uo)} x [0,1]. We conclude that the 
vertical segment S,,, does depend upon the parametrisation, whereas the top 
and bottom loops I, J are common to all parametrisations. In summary, the 
boundary of »’ is the union of the loops, 0X’ = Ig UT}. oO 


Figure 6.19. The arc I’ and surface /’ of Example 6.40 iii) 
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Among surfaces a relevant role is played by those that are closed. A regular, 
simple surface ’ C R® is closed if it is bounded (as a subset of R*) and has 
no boundary (0X7 = @)). This notion, too, is heavily dependent on the surface’s 
geometry, and differs from the topological closure of Definition 4.5 (namely, each 
closed surface is necessarily a closed subset of R°, but there are topologically- 
closed surfaces with non-empty boundary). Examples of closed surfaces include 
the sphere and the torus, which we introduce now. 


Examples 6.41 


i) Arguing as in Example 6.40 i), we see that the parametrisation of the unit 
sphere Y' = {(2,y,z) € R® : 2? + y? + 2? = 1} by spherical coordinates (Ex- 
ample 4.37 iii)) has a boundary, the semi-circle from the North pole to the South 
pole lying in the half-plane x > 0, y = 0. Any rotation of the coordinate system 
will produce another parametrisation whose boundary is a semi-circle joining 
antipodal points. It is straightforward to see that the boundaries’ intersection is 
empty. 


ii) A torus is the surface »’ built by identifying the top and bottom boundaries 
of the cylinder of Example 6.40 ii). It can also be obtained by a 27-revolution 
around the z-axis of a circle of radius r lying on a plane containing the axis, 
having centre at a point of the plane ry at a distance R+r, R > 0, from the 
axis (see Fig. 6.20). A parametrisation is o : [0,27] x [0,27] > R? with 

a(u,v) = (R+rcosu)cosvi+(R+rcosu)sinvj +rsinuk. 
The boundary 02’, is the union of the circle on z = 0 with centre the origin and 
radius R +r, and the circle on y = 0 centred at (R,0,0) with radius r. 
Here, as well, changing the parametrisation shows the geometric boundary 02 
is empty, making the torus a closed surface. Oo 


The next theorem is the closest kin to the Jordan Curve Theorem 4.33 for 
plane curves. 


Figure 6.20. The torus of Example 6.41 ii) 


6.7 Regular surfaces 247 


Theorem 6.42 A closed orientable surface ) divides the space R® in two 


open regions A;, Ae whose common frontier is 3X’. The region A; (the in- 
terior) is bounded, while the other region Ae (the exterior) is unbounded. 


Nonetheless, there do exist closed non-orientable surfaces, which do not separ- 
ate space in an inside and an outside. A traditional example is the Klein bottle, 
built from the Mobius strip in the same fashion the torus is constructed by glueing 
the cylinder’s two rims. 


6.7.4 Piecewise-regular surfaces 


This generalisation of regularity allows the surface to contain curves where differen- 
tiability fails, and grants us a means of dealing with surfaces delimiting polyhedra 
such as cubes, pyramids, and truncated pyramids, or solids of revolution like trun- 
cated cones or cylinders. As in Sect. refsec:bordo, we shall assume all surfaces are 
closed subsets of R°. 


Definition 6.43 A subset ©’ C R® is a piecewise-regular, simple surface 
if it is the union of finitely many regular, simple surfaces 3,,..., 5) K whose 


pairwise intersections 1,1, (if not empty or a point) are piecewise-regular 
curves I}, contained in the boundary of both X}, and Six. 
The analogue definition holds for piecewise-regular compact surfaces. 


Each surface ¥, is called a face (or component) of +’, and the intersection 
curve between any two faces is an edge of » . 

A piecewise regular simple surface »’ is orientable if every component is ori- 
entable in such a way that adjacent components have compatible orientations. 
Proposition 6.39 then extends to (compact) piecewise-regular, simple surfaces. 

The boundary 02’ of a piecewise-regular simple surface »’ is the closure of 
the set of points belonging to the boundary of one, and one only, component. 
Equivalently, consider the union of the single components’ boundaries, without 
the points common to two or more components, and then take the closure of this 
set: this is 0X’. A piecewise-regular simple surface is closed if bounded and with 
empty boundary. 


Examples 6.44 


i) The frontier of a cube is piecewise regular, and has the cube’s six faces as 
components; it is obviously orientable and closed (Fig. 6.21, left). 


ii) The piecewise-regular surface obtained from the previous one by taking away 
two opposite faces (Fig. 6.21, right) is still orientable, but no longer closed. Its 
boundary is the union of the boundaries of the faces removed. O 
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Figure 6.21. The cubes of Examples 6.44 


Remark 6.45 Example ii) provides us with the excuse for explaining why one 
should insist on the closure of the boundary, instead of taking the mere boundary. 
Consider a cube aligned with the Cartesian axes, from which we have removed the 
top and bottom faces. The union of the boundaries of the 4 faces left is given by 
the 12 edges forming the skeleton. Getting rid of the points common to two faces 
leaves us with the 4 horizontal top edges and the 4 bottom ones, without the 8 
vertices. Closing what is left recovers the 8 missing vertices, so the boundary is 
precisely the union of the 8 horizontal edges. oO 


6.8 Exercises 


1. Find the Jacobian matrix of the following functions: 


F(x,y) = e444 + cose + 2y) 5 


f (x,y,z) = (a + 2y? +323) é4 (x + sin 3y + e”) 9 
2. Determine the divergence of the following vector fields: 

f(x,y) = cos(a + 2y) i+ e?**Y j 

f(z,y,z) =(@t+ytz)it(e?tyt27)j+ (ety +23)k 
3. Compute the curl of the vector fields: 

a) f(z,y,z)=xityjticzk 


b) f(z,y,z) =sxyzit+zsinyj+taze%k 
Yy ‘ x ; 
f(x,y) = grad (log, y)” 
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Given f(x,y) = 3x4 2y and g(u,v) = (u+v)i+uvg, compute the composite 
map f og explicitly and find its gradient. 


5. Compute the composite of the following pairs of maps and the composite’s 
gradient: 


f(s,t)=Vert, g(x,y) =2yit = 5 


72) Saye g(r, s,t) =(r+s)t+(r4+3t)j+(s—t)k 


Let f : R? + R® and g: R® + R? be defined by 
f(x,y) =sin(Qx+y)ite*t4 7, g(u,v,z) = (ut 2u2 +329) it (u? —2v) 7. 


a) Compute their Jacobians Jf and Jg. 
b) Determine the composite h = f og and its Jacobian Jh at the point 
(1, —1, 0). 


7. Given the following maps, determine the first derivative and the monotonicity 
on the interval indicated: 


t 
“ y= fay, ¢eli 458) 
V1i-x£ x 
y= [ yy/8+y4 ~ Say, € [0,1] 


8. Compute the length of the arcs: 


y(t) = (t, 3t), t€ (0,1 
y(t) = (tcost,tsint,t), ¢ € [0,27] 
ct) a=). t € [0,1] 


Determine the values of a € R for which the length of y(t) = (t, at?,t®), t € 
(0, 7], equals £(y) =T+T3. 


Consider the closed arc y whose trace is the union of the segment between 
A = (—log2,1/2) and B = (1,0), the circular arc x? + y? = 1 joining B to 
C = (0,1), and the arc y3(t) = (t,e*) from C to A. Compute its length. 


11. Tell whether the following arcs are closed, regular or piecewise regular: 
+(t) = (t(t — 7)?(t — 2nt), cost),  t € [0,27] 
y(t) = (sin? 2rt,t) , t € [0,1] 
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What are the values of the parameter a € R for which the curve 


(at, t?, 0) ift<0, 
WES ny a ae. Ss 
(sina*t,0,t?) ift>0 


is regular? 


Calculate the principal normal vector n(s), the binormal b(s) and the torsion 
b’(s) for the circular helix y(t) = (cost, sint,t), t € R. 


If a curve y(t) = x(t)t + y(t)j is (r(t), O(t)) in polar coordinates, verify that 


I = (SF) + (re). 


15. Check that if a curve y(t) = x(t)i + y(t)j + z(t)k is given by (r(t), 0(t), z(t) 
in cylindrical coordinates, then 


or = (LS) +(r@Z) + (FZ). 


Check that the curve y(t) = «(t)t+y(t)j+<z(t)k, given in spherical coordinates 
by (r(t), y(t), 0(t)), satisfies 


ol? = (SL) + @@)?(F) + C@sine)?(F 


Using Exercise 15, compute the length of the arc y(t) given in cylindrical 
coordinates by 


1 : 1 
40) = (7G), 00), 20) = (v2cost, satssint) ; te (0, a 


Consider the parametric surface 
o(u,v) = uvit (1+ 3u)j + (v2 +2u)k. 


a) Tell whether it is simple. 

b) Determine the set R on which o is regular. 

c) Determine the normal to the surface at every point of R. 

d) Write the equation of the plane tangent to the surface at Py = o(uo, v0) = 
(iA 3). 
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The surfaces 


oi (u,v) = cos(2 — u)i+sin(2—u)j+v°k, (u,v) € [0,27] x [0,1] 
and 
o2(u,v) = sin(3 + 2u)é+ cos(3 + 2u)7 + (1—v)k, (u,v) € [0, x] x [0,1] 


parametrise the same set X) in R®. 
a) Determine +. 
b) Say whether the orientations defined by 0, and @»2 coincide or not. 


c) Compute the unit normals to »’, at Po = (0,1, +), relative to a7, and a2. 


Consider the surface 0 : R? > R°, 
o(u,v) =uitvgt+(u?+3uv+v")k. 


a) What is the unit normal n(u, v) ? 
b) Determine the points on the image ©) at which the normal is orthogonal to 
the plane 8a + Ty — 2z = 4. 


6.8.1 Solutions 


1. Jacobian matrices: 


a) Jf(x,y) = ( 


Je2ety e2tty 
—sin(x+2y) —2sin(x + 2y) ) 


i 4 92? 
b) It e.n2)=({ tcp ) 


2. Vector fields’ divergence: 
a) div f(a, y) = —sin(x + 2y) + e2°t¥ 
b) div f(z,y,z) =1+2y+32? 


3. Curl of vector fields: 


a) curl f(z,y,z)=0 

b) curl f(a, y, z) = (ze¥ — siny)i— (e¥ —azy) jg —azk 
_ 2xry 

: curl f(s, y) = (@? + 2372 


d) Since f is a gradient, Proposition 6.8 ensures its curl is null. 
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4. We have 


fog(u,v) = f(g(u,v)) = f(utv, wv) = 3(ut v) + 2uv 


and 
Vf og(u,v) = (3 + 2v,34 2u). 


5. Composite maps and gradients: 


a) We have 
fog(z,y) = f(g(z,y)) = f(xy, ;) =, ap ' 


vicgteu) = (5) (u+2). 5/2 (2-5). 


b) Since 
f(g(r, s,t)) = f(r +s,r + 3t,s—t) = (rts) (r+3t)(s—2), 


and 


setting h = f og gives the gradient’s components 


ae = (r + 3t)(s—t) + (r+) (s—t) = 2rs + 2st — 2rt+ s? — 3t? 
0 
- 
a (rs yt) = (r+ Bt) (st) + (r+) (r+ Bt) = ro + Gat + Ort — 31? +9? 
Oh 
Fy ("81 t) = 3(r + 8) (8 —t) — (7 +8) (7 + 3t) = Ars — Grt — 6st + 38° — 7°. 
6. a) We have 
_ (2cos(2x+y) cos(2x + y) _f{ 1 4u 92? 
#29) = ( ett2y Jerr2y ) Jg(u,v,z) = Pu —2 0 ® 
b) Since 


h(u,v,z) = f(ut+ Qu? + 323, u? — 2v) 
= sin(u? + Ay? + 622 + 2u— 2v) it e2t? +207 +329 +u—du 5 
and g(1,—1,0) = (3,3), it follows that 


Jh(1, -1,0) = Jf (g(1, -1,0)) Jg(1, -1, 0) = Jf (3, 3) Jg(1, -1,0) 


_ {2cos9 cos9 1 —4 0)\ _ /4cos9 —10cos9 0 
~ \ ee? 2e° 2 -2 0) \ 5e? —8e 0)’ 
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7. Derivatives of integral functions: 


a) As 
t 3 
f'(@) = ——— >9, Ve eR, 
the map is (monotone) increasing on [1, +00). 

b) Since 

Ae ae = 1 
f'(2) -z | y 8494-2) ay— 29/84 0—2p—2 
6 2 2 
1lsl Vie 4 3/2 P 
mst 52 ao 

seal y(8 + ) dy + 4/a =2 +9), 


we obtain f’(z) <0 for any x € [0,1]; hence f is decreasing on [0, 1]. 


8. Length of arcs: 
We shall make use of the following indefinite integral (Vol. I, Example 9.13 v)): 


1 1 
[ vivP ae = 52 1+ a? + 5 log( Lea a) ee. 


a) From 
+'(t) =(1,6t) and ly (é)|| = V1 + 36¢? 


follows 


1 1 6 
a= | Viv a6e at == | V1l+a7dzxr 
0 0 


1|1 1 
— = lla /1 42? + <log(/1 + 2? 
age +2 + 5 loal +2 +2)| 


1 1 


6 


0 


b) Since 
4'(t) = (cost —tsint,sint+tcost,1) and — |/7()|| = V24+ 2, 


we have 


27 Vr 
f(y) = V2+tdt=2 V1+4+ 27 dx 

0 0 
Vir 


1 1 
=2|50 1+ a? + = log( ive +2) 
D D : 


= J2nV/1+ 2n? + log (1 + 2x? +V2r). 
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c) £(y) = #(17V17 - 1672). 
9. As ¥/(t) = (1, 2at, 3t7), we must have 
T 
(yy) =T+T? = ; JT +402? + 91 dt. 
0 
Set g(T) = 1 +T?: the Fundamental Theorem of Integral Calculus yields 


g (T) =1+8T? = V/1+40?T2 + 974, 


Leu, 
(1+ 377)? =1+4a0°T? + 9T*. 
From this, 4a? = 6, so a = +\/3/2. 
10. We have 
L(y) = (m1) + L(y2) + L(y) 
where 
(1) = (t ad ) t € [—log2,1] 

MNT A 9+ Tog 2)’ Bohs 

o(t) = (cost, sin t) , t € [0, 5 

y(t) =(t,e"), + € [- log 2,0]. 


From elementary geometry we know that 


iL 5 
n) = d( A,B) = (1 + log 2)? + 7 = / 7 t+ 2log2 + log’ 2, 


20 


TT 
(a) = 5+ 


For £(-73), observe that -y4(t) = (1,e"), so 


0 
tos) = Ver eet: 
Sipe 


Now, setting u = V1+ e?, we obtain e?’ = u? —1 and du = + dt = wat dt. 
Therefore 


v2 2 v2 
u 1/2 1/2 
~ = 1 apres ey 
l(y3) / 5 du Hak + rea] ——) du 


J2 
U V5/2 2 
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To sum up, 


ty) = + [2 + 2log2 + log? 2+ VI — 2 + log(v3- 1)(vE +2). 


11. Closed, regular, piecewise-regular arcs: 


a) The arc is closed because (0) = (0,1) = y(27). 
Moreover, -y/(t) = (2(¢ — m)(2t? — 4nt + 1”), — sint) implies that -y’(£) is C* on 
[0, 27]. Next, notice sint = 0 only when t = 0, 7,27, and (0) = (—273, 0) £0, 
(27) = (—1073,0) 4 0, +'(7) = 0. The only non-regular point is thus the 
one corresponding to t = 7. Altogether, the arc is piecewise regular. 

b) The fact that (0) = (0,0) and y(1) = (0,1) implies the arc is not closed. 
What is more, 


4'(t) = (Amsin 2rt cos 2nt, 1) 4 (0,0), Ve € [0,1], 


making the arc regular. 


12. Note + is continuous for any t € R with 


? (a, 3¢?, 0) ines 
YO)=4/ 2 ‘ eos 
(a* cos a*t,0,3t*) ift>0. 


Certainly +/(t) 4 0 for t 4 0 and any a € R. To study the point t = 0, observe 
that 


7 (0) = lim 7(¢) = (@,0,0), -¥/(0F) = lim _7/(¢) = (a7, 0,0), 


t>07- t>0+ 


so 7'(0) exists if a= a7, ie., if a =0 or a=1. Ifa =0 then (0) = 0, while for 
a = 1 we have 7'(0) = a. . 0) £0. In conclusion, the only value of regularity is 
a=1. 


13. Bearing in mind Example 6.29, we may re-parametrise the helix by arc length: 


ys) = (cos sin Zs, e. \ seER. 
For any s € R then, 
is) =7 (= (- ae 2) ue cos 2, _) 
2 2 2 2 2 
and ‘ JB ; i 
i(s)= (- 58 85 sin “=s,0) 


with curvature K(s) = ||t’(s)|| = 1/2. Therefore 
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V2 VP 
nis) = (- cos —S, — sin —s, 0) 
Z 2 
= v2. vB v3_ v2 
2 2 2 2 Z 
b(s) = t(s) /\ n(s) = ee sin ere ys COS re =) : 
At last, 
iL 2 1 2 
b’(t) = (5 cos en 5 sin ve 0) 
and the scalar torsion is t(s) = —4. 
14. Remembering that 
t=7 cose , v=rsneg, 
and using the chain rule, 
dx Oxdr  Oxdé dr dé 
, = eT —_ —_ = A — 1 7— 
a ode ode ae 
dy Oydr  OdOydd x lt dé 
'd) = = = SS 4+ SS — = sin J — O—. 
Uae Orde” ae a 


The required equation follows by using sin? @ + cos?@ = 1, for any 0 € 


2 


+ (y'(t))”. 


2 


IYI? = (2) 
16. Recalling 


x=rsinycosé, y=rsingsin6, £=Tr cosy, 
and the chain rule, we have 

(a) =o dear, Ae dy , x do 
ON) at Or dt | dy dt | 00 dt 

. dr dy . . dé 

= sin pcos 0—- + rcos pcos — rsinysind 
yi = 2 = 2uar , Oude, dy a0 
dt Ordt Opydt o6ddt 

' ae . ,dyp dé 

= sin psing-— + rcospsin @— + rsing cos 0 
ap) = 82 22a , Oz dy | dead 
~ dt Ordt Oydt O6dt 


dr . dy 
= csp, —rsing—-. 


Now using sin? € + cos? € = 1, for any € € 


R, and ||7‘(4)||? = (x"(¢)) 


ey (t))”, a little computation gives the result. 


2 


+ (y'(t)) 


R, and 


+ 
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17. Since 
r(t)=—-V2sint,0 ‘(t)= 


we have 


m/2 I x /2 5 
«=f 2sin?t + (cos?) 5 + cos? at = v5 f a= Pr. 
2 0 


Notice the arc’s trace lies on the surface of the ellipsoid x? + y? + 2z2 = 2. The 
initial point is (V2,0,0), the end point (0,0,1) (see Fig. 6.22). 


18. a) The surface is simple if, for (ui, v1), (wa, v2) € R?, we have 


o(u1, U1) = o (ua, V2) > (u1, V1) = (ua, V2) : 


U1VU1 = U2V2 
{+8 =1+3u2 


v? + Qu, = v3 + Que. 


The left equation means 


From the middle line we obtain u, = ug, and then v1 = v2 by substitution. 
b) Consider the Jacobian 


v 4U 
Jo—=|3 O 
2 3v? 


The three minors’ determinants are —3u, 9v?, 3v? — 2u. The only point at which 
they all vanish is the origin. Thus o is regular on R = R? \ {(0,0)}. 


¢) For (u,v) #0, 


Ss. 


k 
v(u,v) = det 2 | =9v74+ (Qu —3v) 7 — 3uk. 


J 
v 3 
0 


Figure 6.22. The arc of Exercise 17 
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d) Imposing 
uv = 1, 1+3u=4, vi +2u=3 


shows the point Po = (1,4,3) is the image under o of (ug, vo) = (1,1). Therefore 
the tangent plane in question is 


Oo Oo 
= 1)+ —(1,1 —1)+—(1,1 —1 
M(u,v) = o(1,1) + 20) w=) + ZY w=1) 
= (1,4,3) + (1,3, 2)(u— 1) + (1,0, 3)(u — 1) 
=(utv—1)i+ But+1j4+ Qut+ 3v—-2)k. 
Using Cartesian coordinates x =u+vu—1, y= 3u+1, z = 2u+4 3v — 2, the plane 
reads 92 — y— 8z2+4=0. 


19. a) As u varies in [0,27], the vector cos(2 — u)4+ sin(2 — u)j runs along the 
unit circle in the plane z = 0, while for v in [0,1], the vector vk describes the 
segment [0,1] on the z-axis. Consequently 7 is a cylinder of height 1 with axis on 
the z’s. 


b) Using formula (6.48) the normal vector defined by oj is 
V1(u,v) = —2vcos(2 — u)t — 2vsin(2 — u)j, 
while the normal of o» is 
V2(u,v) = 2sin(3 + 2u)7+ 2cos(3 + 2u)j. 
If P = o(u1, v1) = 2(u2, v2) is an arbitrary point on 2’, then 
cos(2 — ui) = sin(3+2u2) and sin(2 — ui) = cos(3 + 2u2), 


and so 
V1 (U1, V1) = —V1V2(ua2, v2) . 


Since v1 is non-negative, the orientations are opposite. 
c) We have Po = o (2-4, 3) = a (n—#, 3). Hence 11 (Po) = —J, while v2(Po) = 27. 
The unit normals then are n(Po) = —j and n2(Po) = 7. 


20. a) Expression (6.49) yields 
v(u,v) = —(2u+ 3v)t — (8u+ 2u)g+k, 
normalising which produces 


2 3 3 2 1 
i re 
rl al al 


with ||v||? = 13(u? + v7) + 24uv + 1. 


6.8 Exercises 259 


b) The orthogonality follows by imposing v be parallel to the vector 84 + 77 — 2k. 
Thus we impose 


2u + 3v) = 8X 
3u + 2v) = 7X 
L=—2A; 
Solving the system gives \ = —1/2 and u = 1/2, v = 1. The required condition is 


valid for the point Pp = 0 (3,1) = ($,1,44). 


—~ 


7 


Applying differential calculus 


We conclude with this chapter the treatise of differential calculus for multivariate 
and vector-values functions. Two are the themes of concern: the Implicit Function 
Theorem with its applications, and the study of constrained extrema. 


Given an equation in two or more independent variables, the Implicit Function 
Theorem provides sufficient conditions to express one variable in terms of the 
others. It helps to examine the nature of the level sets of a function, which, under 
regularity assumptions, are curves and surfaces in space. Finally, it furnishes the 
tools for studying the locus defined by a system of equations. 


Constrained extrema, in other words extremum points of a map restricted to 
a given subset of the domain, can be approached in two ways. The first method, 
called parametric, reduces the problem to understading unconstrained extrema 
in lower dimension; the other geometrically-rooted method, relying on Lagrange 
multipliers, studies the stationary points of a new function, the Lagrangian of the 
problem. 


7.1 Implicit Function Theorem 


An equation of type 


sets up an implicit relationship between the variables x and y; in geometrical terms 
it defines the locus of points P = (x,y) in the plane whose coordinates satisfy the 
equation. Very often it is possible to write one variable as a function of the other 
at least locally (in the neighbourhood of one solution), as y = y(x) or x = wW(y). 
For example, the equation 


f(x,y) =ax+by+c=0, with a?+b? <0, 
defines a line on the plane, and is equivalent to y = p(x) = —F(ax +c), ifb#0, 
or © = (y) = —4(by +c) ifa £0. 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_7, 
© Springer International Publishing Switzerland 2015 
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Instead, 
f(z,y) =e? +y%—1r? =0, with r>0 


defines a circle centred at the origin with radius r; if (xo, yo) belongs to the circle 
and yo > 0, we may solve the equation for y on a neighbourhood of 29, and 
write y = p(x) = Vr? — x; if xo < 0, we can write x = W(y) = —\/r? —y? ona 
neighbourhood of yo. It is not possible to write y in terms of x on a neighbourhood 
of x) =r or —r, nor x as a function of y around yo = r or —r, unless we violate 
the definition of function itself. But in any of the cases where solving explicitly 
for one variable is possible, one partial derivative of f is non-zero (34 =b2 0, 
of = a # 0 for the line, af = 2y9 > 0, gf = 2x9 < 0 for the circle); conversely, 
where one variable cannot be made a function of the other the partial derivative 
is zero. 

Moving on to more elaborate situations, we must distinguish between the ex- 
istence of a map between the independent variables expressing (7.1) in an explicit 
way, and the possibility of representing such map in terms of known elementary 
functions. For example, in 


ye"* + cosy —2=0 
we can solve for x explicitly, 
1 1 
= — log(2 — | 
T= og(2 — cos y) 5 logy. 
but we are not able to do the same with 
a + 3a7y —2y*-1=0. 


Nonetheless, analysing the graph of the function, e.g., around the solution (xo, yo) = 
(1,0), shows that the equation defines y in terms of x, and the study we are about 
to embark on will confirm rigorously the claim (see Example 7.2). Thus there is 
a huge interest in finding criteria that ensure a function y = y(x), say, at least 
exists, even if the variable y cannot be written in an analytically-explicit way in 
terms of x. This lays a solid groundwork for the employ of numerical methods. We 
discuss below a milestone result in Mathematics, the Implicit Function Theorem, 
also known as Dini’s Theorem in Italy, which yields a sufficient condition for the 
function y(x) to exist; the condition is the non-vanishing of the partial derivative 
of f in the variable we are solving for. 

All these considerations evidently apply to equations with three or more vari- 
ables, like 


peru) — Oe ee 


Under special assumptions, we are allowed to define z = y(x,y) as a function of x 
and y. A second generalisation concerns systems of equations. 

Numerous are the applications of the Implicit Function Theorem. To name a 
few, many laws of Physics (think of Thermodynamics) tie two or more quantities 
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through implicit relationships which, depending on the system’s specific state, can 
be made explicit. In geometry the Theorem allows to describe as a regular simple 
surface, if locally, the locus of points whose coordinates satisfy (7.2); it lies at the 
heart of an ‘intrinsic’ viewpoint that regards a surface as a set of points defined 
by algebraic and transcendental equations. 

So let us begin to see the two-dimensional version. Its proof may be found in 
Appendix A.1.4, p. 518. 


Theorem 7.1 Let Q be a non-empty open set in R? and f : Q— R aC! map. 


Assume at the point (a0, yo) € 2 we have f(x, yo) = 0. If FE (0,0) # 0, 


there exists a neighbourhood I of x9 and a function py: I + R such that: 
i) (a,p(x)) €Q for anya € I; 

tt) yo = p(Xo); 

iti) f (2,p (z)) =0 for any z € I; 

iv) p is aC! map on I with derivative 


(7.3) 


On a neighbourhood of (x0, yo) moreover, the zero set of f coincides with the 
graph of vp. 


We remark that «> (x) = (x,y (z)) is a curve in the plane whose trace is 
precisely the graph of yp. 


Example 7.2 
Consider the equation 
a + 3a7y — 2y*=1, 
already considered in the introduction. It is solved by (xo, yo) = (1,0). To study 


the existence of other solutions in the neighbourhood of that point we define the 
function 
a: 2 4 
f(a,y) = 2° + 8a*y — 2y* —-1. 
We have f,(z,y) = 3x7 — 8y?, hence f,(1,0) = 3 > 0. The above theorem 
guarantees there is a function y = v(x), differentiable on a neighbourhood J of 
xo = 1, such that y(1) = 0 and 
x + 3x70(x) — 2(o(x))* = 1, eel. 

From fx(x,y) = 5x24 + 6ry we get fr(1,0) = 5, so y’/(1) = —8. The map y is 


3 
decreasing at 29 = 1, where the tangent to its graph is y = —3(a —1). 
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a) 
In case FL (a, yo) # 0, we arrive at a similar result to Theorem 7.1 in which 
a 


x and y are swapped. To summarise (see Fig. 7.1), 


Corollary 7.3 Let Q be a non-empty open set in R? and f: 27> RaC! 
map. The equation f(x,y) = 0 can be solved for one variable, y = p(x) or 


x =w(y), around any zero (x0, yo) of f at which Vf (x0, yo) £0 (i.e., around 
each regular point). 


There remains to understand what is the true structure of the zero set in the 
neighbourhood of a stationary point. Assuming f is C?, the previous chapter’s 
study of the Hessian matrix Hf(xo, yo) provides useful information. 


i) If Hf (xo, yo) is definite (positive or negative), (xo, yo) is a local strict minimum 
or maximum; therefore f will be strictly positive or negative on a whole punctured 
neighbourhood of (29, yo), making (29, yo) an isolated zero for f. An example is 
the origin for the map f(x,y) = 27 +y?. 


ii) If Hf (xo, yo) is indefinite, i.e., it has eigenvalues A; > 0 and Ag < 0, one can 
prove that the zero set of f around (20, yo) is not the graph of a function, because it 
consists of two distinct curves that meet at (xo, yo) and form an angle that depends 
upon the eigenvalues’ ratio. For instance, the zeroes of f(a, y) = 4x? — 25y? belong 


to the lines y = +2x, crossing at the origin (Fig. 7.2, left). 


Figure 7.1. Some intervals where f(x,y) = 0 can be solved for x or y 
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YY WY YY 


y = 2a y= S74 


Figure 7.2. The zero sets of f(x,y) = 4x7 — 25y? (left), f(a,y) = «* — y® (centre) and 
f(x,y) = x* — 16y” (right) 


iii) If Hf(xo,yo) has one or both eigenvalues equal zero, nothing can be said in 
general. For example, assuming the origin as point (0, yo), the zeroes of f(x,y) = 
a — y® coincide with the graph of y = Vx* (Fig. 7.2, centre). The map f(x,y) = 
z+ — 16y?, instead, vanishes along the parabolas y = +42? that meet at the origin 
(Fig. 7.2, right). 


We state the Implicit Function Theorem for functions in three variables. Its 
proof is easily obtained by adapting the one given in two dimensions. 


Theorem 7.4 Let 2 be a non-empty open subset of R® and f: 2+ Ra 
0 

map of class C! with a zero at (xo, yo, 20) € Q. If FE (x0, 0520) # 0, there 

exist a neighbourhood A of (x9, yo) and a function p: AR such that: 


i) (x,y, (x, y)) € 2 for any (x,y) € A; 

ti) 20 = Y(Zo, yo); 

iti) f (a, Y,~ Ga) = 0 for any (x,y) € A; 
iv) p is C! on A, with partial derivatives 


6) 
a (x, yy (x, y)) S(c, ye (2,4) 
io aa Se. 5, \e 2 ar (7.4) 
By (tye (x, y)) y(t YUP (Ge) 


OB 


Moreover, on a neighbourhood of (xo, yo, 20) the zero set of f coincides with 
the graph of vp. 


The function (z,y) + o(2,y, 2) = (x,y, (Gs y)) defines a regular simple surface 
in space, whose properties will be examined in Sect. 7.2. 

The result may be applied, as happened above, under the assumption the point 
(xo, yo, Zo) be regular, possibly after swapping the independent variables’ roles. 


These two theorems are in fact subsumed by a general statement on vector- 
valued maps. 
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Let n, m be integers with n > 2 and 1 <m <n-—1. Take an open, non-empty 
set Q in R” and let F : Q — R™ be aC! map on it. Select m of the n independent 
variables 11, %2,...,£n; without loss of generality we can take the last m variables 
(always possible by permuting the variables). Then write a as two sets of coordin- 
ates (€, 4), where € = (71,...,2n—m) € R"”™ and pb = (fn—m41,---;2n) € R™. 
Correspondingly, decompose the Jacobian of F at x € (2 into the matrix J¢ F(x) 
with m rows and n—m columns containing the first n —m columns of JF (a), and 
the m x m matrix J, F(a) formed by the remaining m columns of JF (a). 

We are ready to solve locally the implicit equation F(a) = 0, now reading 
F(€, 4) = 0, and obtain p = ®(€). 


Theorem 7.5 (Implicit Function Theorem, or Dini’s Theorem) With 
the above conventions, let xo = (0, Ho) € 2 be a point such that F (ao) = 0. 
If the matric J,,F (ao) is non-singular there exist neighbourhoods A C 22 of 
xo, and I of €) in R"-™, such that the zero set 

{x EA: F(x) =0} 


of F coincides with the graph 


{x = (€,B(€)) :€ € I} 


of aC! map ® : I > R™ satisfying ®(€) = wo. On I, the Jacobian of ® 
(with m rows and n—m columns) is the solution of the linear system 


IF (€, ®(£)) JB(E) = —IeF (E, B(E)). (7.5) 


Theorems 7.1 and 7.4 are special instances of the above, as can be seen taking 
n=2,m=1, and n= 3, m=1. The next example elucidates yet another case of 
interest. 


Example 7.6 
Consider the system of two equations in three variables 
{ filz,y,z) =0 
fala, y,2)=0 


where f; are C' on an open set 2 in R3, and call xp = (x9, yo, 20) € 2 a solution. 
Specialising the theorem with n = 3, m = 2 and F = (fi, fz) ensures that if 


O O 
Heo) Feo) 
O 6) 
(a0) Fe) 


is not singular, the system admits infinitely many solutions around ao; these 
are of the form (x,y 1(2%), yo(x)), where ® = (1,2): 1 CR—R? isC' ona 
neighbourhood I of xo. 
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The components of the first derivative ®’ at x € I solve the system 


Ofi Ofi 


By PO) ex) Bz (vale) ea(e)) \ (ei@)\ 
ee) 3B (esp (0), 92(2)) P2(2) 
(FE levi@,v2(0) 
| Ofs 


hae: (xp ie), ~2 (x)) 


The function x + +(x) = (x,p1(x), 2(x)) represents a (regular) curve in space. 


7.1.1 Local invertibility of a function 


One application is the following: if the Jacobian of a regular map of R” is non- 
singular, then the function is invertible around the point. 

This generalises to multivariable calculus a property of functions of one real 
variable, namely: if f is C! around zo € domf C R and if f’(x9) # 0, then 
f’ has constant sign around 29; as such it is strictly monotone, hence invertible; 
furthermore, f~' is differentiable and (f~')/(yo) = 1/f' (ao). 


Proposition 7.7 Let f : domf C R” — R” be C! around a point a € 
dom f. If Jf(xo) is non-singular, the inverse function x = f—'(y) is well 
defined on a neighbourhood of yo = f (xo), of class C', and satisfies 


I(f-*)(yo) = (If(ao)) (7.6) 


Proof. Define the auxiliary map F : domf x R” C R2” > R” by F(a,y) = 
f(x) — y. Then F(ao,yo) = O and JF (xo, yo) = (Jf(ao), 7) (an 
n X 2n matrix). Since J, F (x0, yo) = Jf(xo), Theorem 7.5 guarantees 
there is a neighbourhood B,(yo) and a C! map g : B,(yo) ~ domf 
such that F(g(y),y) = 0, ie., f(g(y)) = y, Vy € Br(yo). Therefore 
g(y) = f~*(y), so (7.6) follows from (7.5). 


The formula for the Jacobian of the inverse function was obtained in Corollary 6.16 
as a consequence of the chain rule; that corollary’s assumptions are eventually 
substantiated by the previous proposition. 


What we have seen can be read in terms of a system of equations. Let us 
interpret 

f(x) =y (7.7) 

as a (non-linear) system of n equations in n unknowns, where the right-hand side 

y is given and « is the solution. Suppose, for a given datum yo, that we know a 
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solution ao. If the Jacobian Jf (ao) is non-singular, the equation (7.7) admits one, 
and only one solution x in the proximity of xo, for any y sufficiently close to yo. 
In turn, the Jacobian’s invertibility at xo is equivalent to the fact that the linear 
system 

f (xo) + Jf (xo)(w — xo) = y, (7.8) 


obtained by linearising the left-hand side of (7.7) around ao, admits a solution 
whichever y € R” is taken (it suffices to write the system as Jf(xao)x = y — 
f (xo) + Jf(xo)xo and notice the right-hand side assumes any value of R” as y 
varies in R”). 

In conclusion, the knowledge of one solution ao of a non-linear equation for a 
given datum yo, provided the linearised equation around that point can be solved, 
guarantees the solvability of the non-linear equation for any value of the datum 
that is sufficiently close to yo. 


Example 7.8 


The system of non-linear equations 

oe 

3a? + xy? = b 

admits (x0, yo) = (1,2) as solution for (a,b) = (5,7). Calling f(x,y) = (#2 + 
y® — 2xy, 3x” + xy”), we have 

7 3x22 —2Qy 3y? — 2x 
As 
10 4 
—1 10 


we can say the system admits a unique solution (z, y) for any choice of a, b close 
enough to 5,7 respectively. 


det Jf (1,2) = det ( ) =104>0, 


7.2 Level curves and level surfaces 


At this juncture we can resume level sets L(f,c) of a given function f, seen 
in (4.20). The differential calculus developed so far allows us to describe such 
sets at least in the case of suitably regular functions. The study of level sets offers 
useful informations on the function’s behaviour, as the constant c varies. At the 
same time, solving an equation like 


fy)=0, or f(x,y,z) =0, 


is, as a matter of fact, analogous to determining the level set L(f,0) of f. Un- 
der suitable assumptions, as we know, implicit relations of this type among the 
independent variables generate regular and simple curves, or surfaces, in space. 

In the sequel we shall consider only two and three dimensions, beginning from 
the former situation. 
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7.2.1 Level curves 


Let then f be a real map of two real variables; given c € im f, denote by L(f,c) the 
corresponding level set and let ao € L(f,c). Suppose f is C' on a neighbourhood 
of the regular point xo, so that Vf(ao) 4 0. Corollary 7.3 applies to the map 
f(a) —c to say that on a certain neighbourhood B(ao) of xo the points of L(f, c) 
lie on a regular simple curve y : J — B(ao) (a graph in one of the variables); 
moreover, the point tg € J such that (to) = ao is in the interior of J. In other 
terms, 


L(f,c) 0 B(ao) = {a € R?: w@ = x(t), with te I}, 


whence 
fliv@)=« Wel. (7.9) 


If the argument is valid for any point of L(f,c), hence if f is C' on an open set 
containing L(f,c) and all points of L(f,c) are regular, the level set is made of 
traces of regular, simple curves. If so, one speaks of level curves of f. The next 
property is particularly important (see Fig. 7.3). 


Proposition 7.9 The gradient of a map is orthogonal to level curves at any 


of their regular points. 


Proof. Differentiating (7.9) at to € I (recall (6.13)) gives 
Vf (xo) -¥'(t) = 9, 


and the result follows. Oo 


Moving along a level curve from a maintains the function constant, whereas in 
the perpendicular direction the function undergoes the maximum variation (by 
Proposition 5.10). It can turn out useful to remark that curlf, as of (6.7), is 
always orthogonal to the gradient, so it is tangent to the level curves of f. 


Figure 7.3. Level curves and gradient of a map 
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Figure 7.4. The relationship between graph and level sets 


Fig. 7.4 shows the relationship between level curves and the graph of a map. 
Recall that the level curves defined by f(x,y) = ¢ are the projections on the ry- 
plane of the intersection between the graph of f and the planes z = c. In this 
way, if we vary c by a fixed increment and plot the corresponding level curves, the 


latter will be closer to each other where the graph is “steeper”, and sparser where 
it “flattens out”. 


U4 


Figure 7.5. Level curves of f(x,y) = 9 — x? — y? 
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Figure 7.6. Graph and level curves of f(x,y) = —axye * -¥ (top), and f(z,y) = 
Ses (bottom) 


Examples 7.10 
i) The graph of f(x,y) = \/9 — 2? — y? was shown in Fig. 4.8. The level curves 


are 
V9I- 2? —y2=c Le., g+y?=9-c?. 


This is a family of circles with centre the origin and radii V9 — c?, see Fig. 7.5. 


ii) Fig. 7.6 shows some level curves and the corresponding graph. 


Patently, a level set may contain non-regular points of f, hence stationary 
points. In such a case the level set may not be representable, around the point, as 
trace of a curve. 
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U4 


Figure 7.7. Level set L(f,0) for f(x,y) =a*—2?+y? 


Example 7.11 
The level set L(f,0) of 
fey) =a +e 
is shown in Fig. 7.7. The origin is a saddle-like stationary point. On every neigh- 


bourhood of the origin the level set consists of two branches that intersect or- 
thogonally; as such, it cannot be the graph of a function in one of the variables. 


Archetypal examples of how level curves might be employed are the maps used 
in topography, as in Fig. 7.8. On pictures representing a piece of land level curves 
join points on the Earth’s surface at the same height above sea level: such paths 
do not go uphill nor downhill, and in fact they are called isoclines (lit. ‘of equal 
inclination’). 

Another frequent instance is the representation of the temperature of a certain 
region at a given time. The level curves are called isotherms and connect points 
with the same temperature. Similarly for isobars, the level curves on a map joining 
points having identical atmospheric pressure. 


Ya) 


Tig mt 


Figure 7.8. Three-dimensional representation of a territory and the corresponding iso- 
clines 
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7.2.2 Level surfaces 


In presence of three independent variables, level sets can be parametrised around 
regular points. To be precise, if f is C! around a € L(f,c) and Vf(ao) 40, the 
3-dimensional analogue of Corollary 7.3 (that easily descends from Theorem 7.4) 
guarantees the existence of a neighbourhood B(aq) of xo and of a regular, simple 
surface o : R — B(ao) such that 


L(f,c)N B(ao) = {a € R?: ex = o(u,v), with (u,v) € R}, 


hence 
f(o(u,v)) =e V(u,v) ER. (7.10) 


In other terms L(f,c) is locally the image of a level surface of f, and Proposition 7.9 
has an analogue 


Proposition 7.12 At any regular point, the gradient of a function is parallel 


to the normal of the level surface. 


Proof. The partial derivatives of foo at (uo, v0) € R such that o(uo, v9) = Xo 


are 
Oo Oo 
V f (xo) : By (uo vo) = Os Vf (xo) é By (uo vo) = ()., 
The claim follows now from (6.48) O 


Examples 7.13 


i) The level surfaces of f(x, y,z) = 2? + y* + 2? form a family of concentric 
spheres with radii \/c. The normal vectors are aligned with the gradient of f, 
see Fig. 7.9. 


ii) Let us explain how the Implicit Function Theorem, and its corollaries, may 
be used for studying a surface defined by an algebraic equation. Let 1’ C R® be 
the set of points satisfying 

f(x,y, 2) = 2x4 + 2y4 +224 4+2-y-6=0. 
The gradient Vf (x, y, z) = (8a3+1, 8y3—1, 8z3) vanishes only at Py = (— 5, s, 0), 
but the latter does not satisfy the equation, i.e., Py ¢ 5’. Therefore all points 
P of »’ are regular, and »’ is a regular simple surface; around any point P the 
surface can be represented as the graph of a function expressing one variable in 
terms of the other two. 
Notice also 


lim f(x) =+0, 


@L—- Co 


so the open set 


QO=1(ea 2) es fag.) <0}, 
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+e te Ss 


AX 


Figure 7.9. Level surfaces of f(x,y, z) = \/x? + y? + 2? 


containing the origin and of which »’ is the boundary, is bounded; consequently 
+} is a compact set. One could prove »’ has no boundary, making it a closed 
surface. 

Proposition 7.9 proves, for example, that the normal vector (up to sign) to »’ at 
P, = (1,1,1) is proportional to Vf(1, 1,1) = (9,7,8). The unit normal v at Py, 
chosen to point away from the origin (v- a > 0), is then 


i —_—— 


(91 +77 + 8k). 


1 
V¥194 


7.3 Constrained extrema 


With Sect. 5.6 we have learnt how to find extremum points lying inside the do- 
main of a sufficiently regular function. That does not exhaust all possible extrema 
though, because there might be some lying on the domain’s boundary (without 
mentioning some extrema could be non-regular points). At the same time in many 
applications it is required we search for minimum and maximum points on a partic- 
ular subset of the domain; for instance, if the subset is defined through equations or 
inequalities of the independent variable, one speaks about constrained extrema. 

The present section develops two methods to find extrema of the kind just 
described, and for this we start with an example. 

Suppose we want to find the minimum of the linear map f(z) = a-a2, a= 
a V3) € R?, among the unit vectors x of the plane, hence under the constraint 
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||| = 1. Geometrically, the point P = x = (x,y) can only move on the unit 
circle x? + y? = 1. If g(x) = x? +y? —1 = 0 is the equation of the circle and 
G = {a € R? : g(x) = 0} the set of constrained points, we are looking for xo 
satisfying 


roEG and eee) — min fC le 


The problem can be tackled in two different ways, one privileging the analytical 
point of view, the other the geometrical aspects. The first consists in reducing the 
number of variables from two to one, by observing that the constraint is a simple 
closed arc and as such can be written parametrically. Precisely, set y : [0,27] > R?, 
y(t) = (cost, sint), so that G coincides with the trace of y. Then f restricted to 
G becomes f 07, a function of the variable t; moreover, 


i = i t)). 
ie) = sO) 
Now we find the extrema of y(t) = f(y) = cost + V3sint; the map is periodic 
of period 27, so we can think of it as a map on R and ignore extrema that fall 
outside [0,27]. The first derivative y/(t) = — sint + V3 cost vanishes at t = = and 
fn. As y"(Z) = —2 < 0 and y’(S7) = 2 > 0, we have a maximum at t = § 


3 
and a minimum at t = oT. Therefore there is only one solution to the original 
constrained problem, namely ao = (cos an, sin <r) = —(5, =. 


The same problem can be understood from a geometrical perspective relying 
on Fig. 7.10. The gradient of f is V f(a) = a, and we recall f is increasing along 


Figure 7.10. Level curves of f(a) = a- a (left) and restriction to the constraint G 
(right) 
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a (with the greatest rate, actually, see Proposition 5.10); the level curves of f are 
the perpendicular lines to a. Obviously, f restricted to the unit circle will reach 
its minimum and maximum at the points where the level curves touch the circle 


itself. These points are, respectively, xp = —(5, v3), which we already know of, 
and its antipodal point 2; = (5, v3), They may also be characterised as follows. 


The gradient is orthogonal to level curves, and the circle is indeed a level curve for 
g(x); to say therefore that the level curves of f and g are tangent at x;, 1 = 0,1 
tantamounts to requiring that the gradients are parallel; otherwise said at each 
point «x; there is a constant A such that 


V f(a) = AVg(a) . 


These, together with the constraining equation g(a) = 0, allow to find the con- 
strained extrema of f. In fact, we have 


At = 1 
Ay = V3 


substituting the first and second in the third equation gives 


1 
> 
_ v3 
yey 
a = > so A= 2, 


and the solutions are a and a. Since f(a) < f(a1), @o will be the minimum 
point and a, the maximum point for f. 


In the general case, let f : dom f C R” — R be a map in n real variables, and 
G € domf a proper subset of the domain of f, called in the sequel admissible 
set. First of all we introduce the notion of constrained extremum. 


Definition 7.14 A point x9 € G is said a relative extremum point of f 
constrained to G if Xo is a relative extremum for the restriction f\q of f 
to G. In other words, there exists a neighbourhood B,(ao) such that 


Va € B,(ao) NG (ois (ee) 


for a constrained maximum, or 


Va € B(x) NG J(o = Fee) 


in case of a constrained minimum. 
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Constrained absolute maxima and minima are respectively defined by 


f(xo) =max f(x) and — f(ao) =min f(x). 


zrEG xzEG 


A constrained extremum f is not necessarily an extremum as well. The function 
f(x,y) = xy, for example, has a saddle at the origin, but when we restrict to the 
bisectrix of the first and third quadrant (or second and fourth), the origin becomes 
a constrained absolute minimum (maximum). 

A recurring situation is that in which the set G is a subset defined by equations 
or inequalities, called constraints. We begin by examining one constraint and one 
equation, and consider a map g: domg C R” > R, with G C domg, such that 


G = {a € domg: g(x) = 0}; 


in other terms G is the zero set of g, or the level set L(g,0). We discuss both 
approaches mentioned in the foreword for this admissible set. 

Assume henceforth f and g are C! on an open set containing G. The aim is to 
find the constrained extrema of f by looking at the stationary points of suitable 
maps. 


7.3.1 The method of parameters 


This method originates from the possibility of writing the admissible set G in 
parametric form, i.e., as image of a map defined on a subset of R”~!. To be 
precise, suppose we know a C! map w: A C R"~! > R” such that 


G={# ER": x2=7(u) with ue A}. 


In dimensions 2 and 3, this forces G to be the trace of a curve = = 7: I > R? or 
a surface wy = a0 : R — R®. The function w can be typically recovered from the 
equation g(a) = 0; the Implicit Function Theorem 7.5 guarantees this is possible 
provided G consists of regular points for g. Studying f restricted to G is equivalent 
to studying the composite f oa, so we will detect the latter’s extrema on A, which 
has one dimension less than the domain of f. In particular, when n = 2 we will 
consider the map t + f (y(t) of one variable, when n = 3 we will have the 
two-variable map (u,v) + f(o(u,v)). 

Let us then examine the interior points of A first, because if one such, say ug, 
is an extremum for fow, then it is stationary; putting xo = q(uo), we necessarily 
have 


Vulf o)(uo) = Vif (ab(u0)) Jab(u0) = 0. 


Stationary points solving the above equation have to be examined carefully to tell 
whether they are extrema or not. After that, we inspect the boundary OA to find 
other possible extrema. 
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Figure 7.11. The level curves and the admissible set of Example 7.15 


Example 7.15 


Consider f(x,y) = 2? + y—1 and let G be the perimeter of the triangle with 
vertices O = (0,0), A = (1,0), B = (0,1), see Fig. 7.11. We want to find the 
minima and maxima of f on G, which exist by compactness. Parametrise the 
three sides OA, OB, AB by yi(t) = (t,0), yo(t) = (0,t), ya(t) = (¢,1 - 4), 
where 0 < t < 1 for all three. The map fjo4(t) = t? — 1 has minimum at O 
and maximum at A; fiog(t) = ¢—1 has minimum at O and maximum at B, 
and fi4p(t) = t? —t has minimum at C = (5,5) and maximum at A and B. 
Since f(O) = —1, f(A) = f(B) = 0 and f(C) = —i, the function reaches its 


minimum value at O and its maximum at A and B. 


7.3.2 Lagrange multipliers 


The idea behind Lagrange multipliers is a particular feature of any regular con- 
strained extremum point, shown in Fig. 7.12. 


Proposition 7.16 Let xo € G be a regular point for g. If xo is an extremum 
for f constrained to G, there exists a unique constant Ayo € R, called Lag- 
range multiplier, such that 


V f (xo) = AoVg(xo) : ei) 


Proof. For simplicity we assume n = 2 or 3. In the former case what we have seen 
in Sect. 7.2.1 applies to g, since a € G = L(g,0). Thus a regular simple 
curve y : I — R? exists, with ao = (to) for some to interior to I, such 
that o(2) = g (y(t) = 0 on a neighbourhood of 29; additionally, 


Vg9(@0) -¥'(to) =0. 
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Vf 


Vg 


Figure 7.12. At a constrained extremum the gradients of f and g are parallel 


But by assumption t > f (y(t) has an extremum at to, which is stationary 
by Fermat’s Theorem. Consequently 


SF (Y(t) pty = VE(@0) -¥ (to) = 0. 


As V f(ao) and Vg(ao) are both orthogonal to the same vector -y/ (to) 4 0 
(y being regular), they are parallel, i.e., (7.11) holds. The uniqueness of 
Ao follows from Vg(ao) ¥ 0. 

The proof in three dimensions is completely similar, because on the one 
hand g(a(u, v)) = 0 around 2g for a suitable regular and simple surface 
ao : A-» R® (Sect. 7.2.2); on the other the map f(o(u,v)) has a relative 
extremum at (uo, Uo), interior to A and such that o(uo, v9) = Zo. At the 
same time then 


Vg(xo0)Jo(uo,vo) =0, and V f(x0)Ja(uo,vo) = 0. 


As o is regular, the column vectors of Jo(uo, vo) are linearly independent 
and span the tangent plane to o at ao. The vectors Vf (ao) and Vg(ao) 
are both perpendicular to the plane, and so parallel. Oo 


This is the right place to stress that (7.11) can be fulfilled, with g(a) = 0, 
also by a point Z 9 which is not a constrained extremum. That is because the 
two conditions are necessary yet not sufficient for the existence of a constrained 
extremum. For example, f(z,y) =x —y° and g(x,y) =x — y?® satisfy g(0,0) = 0, 
V f(0,0) = Vg(0,0) = (1,0); nevertheless, f restricted to G = {(2,y) € R? : 
a = y®} has neither a minimum nor a maximum at the origin, for it is given by 
yh f(y?,y) =y? —y° (yo = 0 is a horizontal inflection point). 

There is an equivalent formulation for the previous proposition, that associates 
to ap an unconstrained stationary point relative to a new function depending on 
f and g. 
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Definition 7.17 Set 2 =dom fNdomg C R”. The function ££: 2x R—-R 
defined by 


L(a,) = f(a) — Ag(a) 


is said Lagrangian (function) of f constrained to g. 


The gradient of £ looks as follows 


Vie.r)£(#, A) = (VeL(e, d), a (@, )) = (Vf (a) —\V g(a), g(x)) . 


Hence the condition V(z,,)£(#o, Ao) = 0, expressing that (ao, Ao) is stationary for 


L, is equivalent to the system 


ee = \0V9(Xo) , 
g(xo) = 0. 


Proposition 7.16 ensures that each regular extremum point of f constrained by g 
determines a unique stationary point for the Lagrangian L. 


Under this new light, the procedure breaks into the following steps. 


i) Write the system of n+ 1 equations in n + 1 unknowns w = (%,...,2n) 
and A 
{ V f(x) = AVg(x) 
g(x) = 0 


and solve it; as the system is not, in general, linear, there may be a number 
of distinct solutions. 


(7.12) 


For each solution (a9, Ao) found, decide whether ao is a constrained ex- 
tremum for f, often with ad hoc arguments. 

In general, after (7.12) has been solved, the multiplier Xo stops being useful. 
If G is compact for instance, Weierstrass’ Theorem 5.24 guarantees the 
existence of an absolute minimum and maximum of f\g (distinct if f is not 
constant on G); therefore, assuming G only consists of regular points for 
g, these two must be among the ones found previously; comparing values 
will permit us to pin down minima and maxima. 


iii) In presence of non-regular points (stationary) for g in G (or points where 
f and/or g are not differentiable) we are forced to proceed case by case, 
lacking a general procedure. 
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Figure 7.13. The graph of the admissible set G (first octant only) and the stationary 
points of f on G (Example 7.18) 


Example 7.18 


Let us find the points of R° lying on the manifold defined by «+ + y* + z4 = 1 
with smallest and largest distance from the origin (see Fig. 7.13). 

The problem consists in extremising the function f(z, y, z) = ||a||? =2?+y?+2? 
on the set G defined by g(x, y,z) = v*+y*+24—-1= 0. We consider the square 
distance, rather than the distance ||a|| = \/x? + y? + z?, because the two have 
the same extrema, but the former has the advantage of simplifying computations. 
We use Lagrange multipliers, and consider the system (7.12): 


22 = d4a3 
2y = A4y? 
22 = \4z3 


ey ee 10; 
As f is invariant under sign change in its arguments, f(+2,+y,+z) = f(x,y, z), 
and similarly for g, we can just look for solutions belonging in the first octant 
(x > 0, y>0, z > 0). The first three equations are solved by 


1 1 1 
x =0or2 = —, =(0 or y= ——, 2=0o0r2>-——, 
jo * 2 A/D Van 
combined in all possible ways. The point (x,y,z) = (0,0,0) is to be excluded 
because it fails to = the last equation. The choices (= 0, 0), (0, Wore 0) 


=) force zs = 1, hence i. = 3 (since A > 0). Similarly, (x,y,z) = 
(sz, ca (Tr 0, vm) or (0, oat 1) satisfy the fourth equation if A = 
Niort 


Tax) fulfills it if \ = v3 The solutions then are: 
a9), f(x) = V2; 


1 
W 
50; a): f (xs) = V2; 
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|- 


x3 = (0,0,1), f(x3) =1; te = (0, a5, a): f(@e) = v2; 


V2 
@7 = (aga aq) faz) = v3. 
In conclusion, the first octant contains 3 points of G, 71, v2, #3, with the shortest 


distance to the origin, and 1 farthest point #7. The distance function on G has 
stationary points x4, £5, , but no minimum nor maximum. oO 


Now we pass to briefly consider the case of an admissible set G defined by 
m <n equalities, of the type g(a) = 0,1 <i < m. If a € G is a constrained 
extremum for f on G and regular for each g;, the analogue to Proposition 7.16 
tells us there exist m constants Ao;, 1 < 7 < m (the Lagrange multipliers), such 
that 


Vf (xo) = Sy Aoi Vgi(xo) - 
| 
Equivalently, set A = (Ai)i=1,....m) € R™ and g(x) = (gi(@)),_, am then the 
Lagrangian 
L(x, A) = f(x) —A- g(x) (7.13) 


admits a stationary point (ao, Ao). To find such points we have to write the system 
of n+ m equations in the n + m unknowns given by the components of a and A, 


which generalises (7.12). 


Example 7.19 


We want to find the extrema of f(x,y,z) = 3” + 3y + 8z constrained to the 

intersection of two cylinders, 7? + z? = 1 and y? + z? = 1. Define 
gi(2,y,z)=2?+2%-1 and go(z,y,z)=y?+27-1 

so that the admissible set is G = Gi 1 Go, Gi = L(gi,0). Each point of G; is 

regular for g;, so the same is true for G. Moreover G is compact, so f certainly 


has minimum and maximum on it. 
As Vf = (3,3,8), Vgi = (22,0, 2z), Vgo = (0, 2y,2z), system (7.13) reads 


3 = A122 
3 = Andy 
8 = (Ay + Ag) 2z 
a? +27-1=0 


yi +z?—-1=0; 
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the first three equations tell « = OTe y= oer 2= TESTE (with A; 40, Ao £0, 
Ai + A2 # 0), so that the remaining two give 
5 
A= Ag = +5 ; 
Then, setting zo = (2, 2,2) and x = —ao, we have f(a) > 0 and f(a) < 0. 
We conclude that 2p is an absolute constrained maximum point, while a, is an 
absolute constrained minimum. O 


Finally, let us assess the case in which the set G is defined by m inequalities; 
without loss of generality we may assume the constraints are of type g;(x) > 0, 
i = 1,...,m. So first we examine interior points of G, with the techniques of 
Sect. 5.6. Secondly, we look at the boundary OG of G, indeed at points where at 
least one inequality is actually an equality. This generates a constrained extremum 
problem on OG, which we know how to handle. 

Due to the profusion of situations, we just describe a few possibilities using 
examples. 


Examples 7.20 


i) Let us return to Example 7.15, and suppose we want to find the extrema of f 
on the whole triangle G of vertices O, A, B. 

Set gi(z,y) = 2, g(x,y) = y, gs(v,y) = 1-2 —y, so that G = {(z,y) € 
R? : g(x,y) > 0, i = 1,2,3}. There are no extrema on the interior of G, since 
V f(a) = (2x,1) #0 precludes the existence of stationary points. The extrema 
of f on G, which have to be present by the compactness of G, must then belong 
to the boundary, and are those we already know of from Example 7.15. 


ii) We look for the extrema of f of Example 7.18 that belong to G defined by 
a*+y*+24 <1. As Vf(x) = 2a, the only extremum interior to G is the origin, 
where f reaches the absolute minimum. But G is compact, so there must also be 
an absolute maximum somewhere on the boundary OG. The extrema constrained 
to the latter are exactly those of the example, so we conclude that f is maximised 
on G by 27 = (a5: a aR) and at the other seven points obtained from this 
by flipping any sign. 

iii) We determine the extrema of f(x,y) = (a+ ye (@? +9") subject to g(x,y) = 
2x + y > 0. Note immediately that f(x,y) — 0 as ||a|| > co, so f must admit 
absolute maximum and minimum on G. The interior points where the constraint 
holds form the (open) half-space y > —2a. Since 


Vie) = e7 (2 +9") (1 — 2a(a+y),1 —2y(a+ y)) : 


f has stationary points « = a: +); the only one of these inside G is a = 
i ; 

(4,4), for which Hf (ao) = —e71/? ; 3}: The Hessian matrix tells us that 

Zo is a relative maximum for f. On the boundary 0G, where y = —2z, the 


composite map 
2 


v(x) = race —22) = —mpe 
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Figure 7.14. The level curves of f and the admissible set G of Example 7.20 iii) 


admits absolute maximum at 7 = a and minimum at 7 = aT Setting 
n= (> Ta) and 22 = (- a a) we can without doubt say 2, is the 
unique absolute minimum on G. For the absolute maxima, we need to compare 
the values attained at a and a2. But since f(a) = e~!/? and f (a2) = aoe”, 
Zo is the only absolute maximum point on G (Fig. 7.14). 


iv) Consider f(x,y) =x + 2y on the set G defined by 
e+2y+8>0, sx +y+138>0, e—-4y+112>0, 
264 yo = 0, ox —2y—8 <0. 


Figure 7.15. Level curves of f and admissible set G of Example 7.20 iv) 


7.4 Exercises 285 


The set G is the (irregular) pentagon of Fig. 7.15, having vertices A = (0, —4), 
B = (-2,-3), C = (-3,2), D = (1,3), EF = (2,1) and obtained as intersection 
of the five half-planes defined by the above inequalities. As V f(a) = (1,2) 4 0, 
the function f attains minimum and maximum on the boundary of G; and since 
f is linear on the perimeter, any extremum point must be a vertex. Thus it is 
enough to compare the values at the corners 

f restricted to G is smallest at each point of AB and largest at D. This simple 
example is the typical problem dealt with by Linear Programming, a series of 
methods to find extrema of linear maps subject to constraints given by linear 
inequalities. Linear Programming is relevant in many branches of Mathemat- 
ics, like Optimization and Operations Research. The reader should refer to the 
specific literature for further information on the matter. O 


7.4 Exercises 


Supposing you are able to write x°y + xy* = 2 in the form y = v(x), compute 
y’(x). Determine a point xo around which this is feasible. 


The equation x? + y>z = “2 admits the solution (2,1,4). Verify it can be 
written as y = v(x, z), with » defined on a neighbourhood of (2, 4). Compute 
the partial derivatives of ~ at (2,4). 


Check that 


e744 9? —y? —e(x +1) +1=0 


defines a function y = (x) on a neighbourhood of xp = 0. What is the nature 
of the point xo for p? 


Verify that the equation 
og? +22 +e4%+y—222=0 


gives a map y = »(2,z) defined around P = (x0, 20) = (—1,0). Such map 
defines a surface, of which the tangent plane at the point P should be determ- 
ined. 


a) Verify that around Py = (0,2,—1) the equation 
xlog(y + z) + 3(y —2)z+sinz =0 


defines a regular simple surface 3’. 
b) Determine the tangent plane to »’ at Po, and the unit normal forming with 
v = 44+ 29 — 5k an acute angle. 
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Check that the system 


an — cosy +y+t+e* =0 
x—e*—y+z+1=0 


yields, around the origin, a curve in space of equations 
v(t) = (t,¢1(4), a(t), = t € T(0). 


Write the equation of the tangent to the curve at the origin. 


Verify that 
y’ + 3y — 2re?* = 0 


defines a function y = (x) for any x € R. Study y and sketch its graph. 


8. Represent the following maps’ level curves: 


SI Kes ) =6—32 —2y BI) few ) = 4a? + y? 


f(x,y) = 2xy f(x,y) = 2y — 3logz 
; f(x,y) = V/4a + 3y : f(x,y) = 4a — 2y? 


Match the functions below to the graphs A—F of Fig. 7.16 and to the level 
curves I-VI of Fig. 7.17: 


a) z= cos \/x? + 2y? b) z= (x? - yee 


i 
e) z=cosx sin2y f) z=6cos? 2 — —2? 


10 


Describe the level surfaces of the following maps: 
la) | fey, 2) = 2+ yt 5z [b)] f(@.y,2) =a -y? +2? 
Determine the maximum and minimum points of 
f(ey)= 2? 4+ y+ s+ 
on the set G = {(z, y) € R? : 4a? + y? —1=0}. 


Find maximum and minimum points for f(x,y) = x +3y +2 on the compact 
set G = {(z,y) € R? : 2,y >0, 22 +y? <1}. 
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xercise 9 
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Consider the function 
f(x,y) = 2*(y +1) — 2y. 


a) Find its stationary points and describe their type. 
b) Compute the map’s absolute minimum and maximum on the set 


G={(2,y) €R?: V/14+22<y <2}. 


Determine the extrema of f(x,y) = 2x? + y? constrained to 


Gain) EI? 2a? aa +7? = 5 = 01. 


What are the absolute minimum and maximum of f (x,y) = 4x? + y? — 2x — 
4y+1on 
G = {(x,y) € R? : 42? + y? —1=0}? 


7.4.1 Solutions 
1. Set f(x, y) = x? y + xy* — 2; then 
fr(x,y) =30°yty* and fy(z,y) = 2° +42y’, 


So 
4 
1, _ _ 30? (x) + (p(x) 
yg (z) = -—_——_—_ > 
x + 4r(y(z)) 
for any x # 0 with 2? 4 —4y’. 
For instance, the point (0, yo) = (1, 1) solves the equation, thus there is a map 
y = v(x) defined around zp = 1 because f,(1, 1) = 5 4 0. Moreover, y’(1) = —4/5. 


2. Setting f(x,y, z) =27 + y%z—- <2, we have 


fe(a,y,z) =2e—= with f,(2,1,4) =0, 
y 
Ties = Gye gy with. f,(2,1,4) =20 + 0, 
¥y y2 ¥ 
f.(a,y,z) aia with f.(2,1,4)=—140. 


Due to Theorem 7.4 we can solve for y, hence express y as a function of x and z, 
around (2, 4); otherwise said, there exists y = y(z, z) such that 
Ov 1 


—— — — (2.4 = =. 
(2, 4) Ds Az ) 20 
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3. Call f(x,y) = e?-¥ + x? — y? — e(a +1) +1 and notice (0,—1) is a solution. 
Then 

iD) =@° Y -p 29 =e, fy tey) = —e" Y — 2y 
with f,(0,-—1) = 0, f,(0,-—1) = 2—e #0. Theorem 7.1 guarantees the existence 
of a map y = (a), defined around the origin, such that 


C= 


p'(0) = — 


Hence 29 = 0 is a critical point of y. 


4. If we set 
f(a,y,z) = a7 +20 4+ e¥ + y — 223 


then f(—1,0,0) = 0. We have 
fa(@,y, 2) =20+2 with f,(—1,0,0) =0, 
fy(z,y,2) =e" +1 with f,(—1,0,0) =240, 
fe(a,y, z) = —62? with f.(—1,0,0) =0. 

By Theorem 7.4 there is a map y = ¢(, z) around (—1,0) satisfying 


Oy Oy 
<* (-1,0) = 2 (-1,0) 


The tangent plane is therefore 


y= y(—1,0) tr Y2(—1,0)(x + 1) + yz(—1, 0)(z a 0) = 0. 


5. a) The gradient of f(x,y, z) = zlog(y + z) + 3(y — 2)z+sinz is 


Vi (1.4.2) = (logly +2) + cose, —— +32, —— + 3(y—2)), 


y+ 


so Vf(Po) = (1,-3,0) 4 0. By Theorem 7.4 then, we can express x via y and 
z, or y in terms of x, z. Therefore, around Po the surface » is locally a regular 
simple graph. 

b) The tangent plane at Py = 2p is, recalling Proposition 7.9, 


Vf(xo)- (a —x#o) =x —3(y-—2)=0, 


so x — 3y = —6. The unit normal at Po will be vy = y(t — 39), with o € {+1} 


defined by v-v = aa > 0, whence o = —1. 


6. Referring to Theorem 7.5 and Example 7.6, let us set 


ee 
fo(z,y,z) =x2-—e*-—y+2z4+1. 
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Then af af 
1 ae Of, _ 2 
Gy ee nek Ae (t,4,2) =e" 5 
Of Of 
a | ae =a 
Dy (2952) Ae (x,y, 2) 
Now consider the matrix 
Of, Of 
ay (0) Ae (0) - i | 
Ofe Ofe —14/)- 


By 5, (9) 


this being non-singular, it defines around t = 0 a curve +(t) = ( 
For the tangent line we need to compute 7/(t) = (1, y4(t), 9 
ticular (0) = (1, 40), 95(0)). But 


Of, Of, Of 
By 5, (9) y' (0) - Bq (9) 
Of Of ) | ag 
200) “2) 5(0) “2) 
11 (0)\ (3 
Ui ; oleae 


so ; : 
{ v1 (0) + ~5(0) = —3 
(0) — 95(0) =O ° 
solved by y{(0) = v4 (0) = —3/2. In conclusion, the tangent line is 


T(t) =7(0) +7 Ot = (1, -5, -5)e = (7, -$i -51) 


7. The map f(z,y) = y’ + 3y — 2ze*” has derivatives 


ita) = =e" + 32), 1,¢.4)= Wi +3>0. 


Theorem 7.1 says we can have y as a function of x around every point of R?, so 
let y = y(x) be such a map. Then 


y’ (2) _— _ fc(@,y) = 2e°* (1 a 32) ; 
fy(@,y) 7y® +3 
and y’(x) = 0 for x = —-1/3, y’(x) > 0 for x > —1/3. Observe that f(0,0) = 0, so 
(0) = 0. The function passes through the origin, is increasing when x > —1/3, 
decreasing when x < —1/3. The point « = —1/3 is an (absolute) minimum for y, 
with y(—1/3) <0. 
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U4 


wle 


Figure 7.18. The graph of the implicit function of Exercise 7 


To compute the limits as + + og note 


(v(x))’ + 3(y(z)) = 2re3* 


and 
lim 2ze°* =0, lim 2re?* = +00. 
L—>— oo L—+00 


Let €= lim ¢(z), so that 
w—->>—OCoO 


£7 +32 = £68 +3) =0 and hence ¢=0. 


Then call m = lim p(x); necessarily m = +00, for otherwise we would have 
«wL—- +00 


m’ + 3m = +00, a contradiction. 
In summary, 


lim e(2) =0, tim pla) = +00: 
the point x = —1/3 is an absolute minimum and the graph of f can be seen in 


Fig. 7.18. 


8. Level curves: 


a) These are the level curves: 
6— 3x2 -—2y=k i.e., 34+ 2y+k—-6=0. 


They form a family of parallel lines with slope —3/2, see Fig. 7.19, left. 
b) The level curves 


2 2 
der +yr=k ie, +2 =1 
oe ey 


are, for k > 0, a family of ellipses centred at the origin and semi-axes Vk/2, 
Vk. See Fig. 7.19, right. 
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Y y 


Figure 7.19. Level curves of f(x,y) = 6 — 3a — 2y (left), and of f(#,y) = 4x? + y? 
(right) 


¢ 
d 
e 
f 


9. The correct matches are : a-D-IV; b-E-III; c-A-V; d-B-I; e-C-VI,; f-F-II. 


See Fig. 7.20, left. 
See Fig. 7.20, right. 
See Fig. 7.21, left. 
See Fig. 7.21, right. 


eee See ee See 


10. Level surfaces: 


a) We are considering a family of parallel planes of equation 
e+3yt+5z-k=0. 


b) These are hyperboloids (with one or two sheets) with axis on the y-axis. 


y y 
rN A 
k<0O k>0O 
k=0 
= x 
k>0 k<0O 


Figure 7.20. Level curves of f(x,y) = 2axy (left), and of f(x,y) = 2y — 3log x (right) 
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Figure 7.21. Level curves of f(x,y) = 4a + 3y (left), and of f(x,y) = 4a — 2y” (right) 


ll. The set G is an ellipse that we can parametrise as y(t) = (5 cos t, sin t), 
t € [0,27). Thus 


1 3 3 3 
YE) = foxy) = 7 cost + sin’ t + qoost +l = — 7 cos? t + qost + 2, 
with 3 3 3 i 

ei= 5 Sint cost — qsint= 5 Sint (cost — 5) 


Note y’(t) = 0 for sint = 0 or cost = 4, so for ty = 0, te = 7, t3 = $,t4= aT 

Moreover, y'(t) > 0 for t € (0,3) U (x or ), hence y increases on (0,7) an 

(x, 37), while it decreases on (3, m7) and (37, 2r). Therefore P, = y(t1) = (4, - 
1. 


and P, = (tz) = (—4,0) are local minima with values f(P;) = 2, f(P2) = 


the points P; = y(t3) = (4, V3), Peo) =e —¥3) are local Se eh 
(Ps) Hf = 3 . In particular, the maximum value of f on G is 3 reached 
at both P3 and Py. alale the minimum value 1/2 is attained at P». 

An alternative way to solve the exercise is to use Lagrange multipliers. If we 


set g(x, y) = 4x7 + y? — 1, the Lagrangian is 


Ee yr) = f(a, y) = Ag(@,y) : 


Since Vf (x,y) = (24 + 3, 2y), Vg(x, y) = (82, 2y), we have to solve the system 


2c +3 = 8rr 
2y = 2rAy 
Ag? pa? = 1, 


From the second equation we get \ = 1 or y = 0. With A = 1 we find 2 = <, 


y= +3 (the points P3, Py); taking y = 0 gives x = +4 (and we obtain P; and 
P2). Computing f at these points clearly gives the same result of the parameters’ 
method. 
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12. Since Vf(z, y) = (1,3) ¥ (0,0) for all (x,y), there are no critical points on 
the interior of G. We look for extrema on the boundary, which must exist by 
Weierstrass’ Theorem. Now, 0G decomposes in three pieces: two segments, on the 
x- and the y-axis, that join the origin to the points A = (1,0) and B = (0,1), 
and the arc of unit circle connecting A to B. Restricted to the segment OA, 
the function is f(z,0) = «+2, x € [0,1], so x = 0 is a (local) minimum with 
f(0,0) = 2, and x = 1 is a (local) maximum with f(1,0) = 3. Similarly, on OB 
the map is f(0,y) = 3y + 2, y € [0,1], so y = 0 is a (local) minimum, f(0,0) = 2, 
and y = 1 a (local) maximum with f(0,1) = 5. At last, parametrising the arc by 
+(t) = (cost,sint), t € [0, 2 /2] gives 


p(t) = f o-+y(t) =cost+ 3sint+2. 


Since y’(t) = —sint + 3cost is zero at to = arctan3, and y(t) > 0 for t € (0, tol, 
the function f has a (local) maximum at y(to) = (x0, yo). To compute xo, yo 
explicitly, observe that 

{ sinto = 3costo 


sin? to + cos? tpg = 1, 
whence 9 cos” to + cos” to = 9x2 + x2 = 1022 = 1, zo = 1/10, and so yo = 3/V10 


(remember xz, y > 0 on G). Furthermore, f(xo, yo) = 2 + V10. Overall, the origin 
is the absolute minimum and (Zo, yo) the absolute maximum for f. 


13. a) Given that Vf(z,y) = (2x(y + 1), 2? — 2), the stationary points are P; = 
(V2, —1) and P, = (-vV2, —1). The Hessian 


Ay +1) ) 


Af (x,y) = ( es , 


at those points reads 


Af(P1) = (5 “) » Af(P2) = ee ) 


so P,, Pp are both saddle points. 


b) By part a), there are no extrema on the interior of G. But since G is compact, 
Weierstrass’ Theorem ensures there are extremum points, and they belong to 0G. 
The boundary of G is the union of the horizontal segment between A = (—V3, 2) 
and B = (1/3,2) and of the arc joining A to B with equation y = /1 +2? 
(Fig. 7.22). 

On AB we have 


f(x,2) =32?-4, 2x € [-v3, v3] 


whence f has local minimum at « = 0 and local maximum at « = £V3, with 
f (0,2) = —4 and f(+V3, 2) =5 respectively. 
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(V3, 2) 


Figure 7.22. The admissible set G of Exercise 13 


On the arc connecting A and B along y = /1+4+ 2”, we have 


v(x) = f(x, V1 +2?) = 2? (V14ta241)-2V14+22, « €[-V3, V3]. 


ee 3a? + 2/1 + 2? 
vite 
vanishes at x = 0 only, and is positive for x > 0, the point x = 0 is a local minimum 
with f (0,1) = —2, while = +V3 are local maxima. 
Therefore, P; = (0,2) is the absolute minimum, P, = (V3, 0) and P3 = 
(—/3,0) are absolute maxima on G. 


14. Define g(a, y) = «4 — x? + y? — 5 and use Lagrange multipliers. As 
V i (ey) = (44,29), Vgo(x,y) = (4x? — 2a, 2y) , 


we consider the system 
Ag = \2x(227 — 1) 
2y = A2y 
gt — og? t+y?—-5=0. 


The second equation gives \ = 1 or y = 0. In the former case we have x = 0 or 
a = +,/3/2, and correspondingly y = +V5 or y = +V17/2; in the latter case 


a? = ey so 2 > 24/ 142i (note 1— 21 < 0 gives a non-valid x”). Therefore, 


3. V17 jli+v2l 
Pis= (0, +5), P3456 = ( Ji 5) ) ) P73 = (+ 5 .0) 


are extrema, constrained to G. From 


f (P12) =5, f (P3,4,5,6) = = : f(P7s) =14+Vv21 


the maximum value of f is 29/4, the minimum 5. 
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15. Let us use Lagrange’s multipliers putting g(x, y) = 4x? + y? — 1 and observing 
Vif (x,y) = (4v + 2,2y—4), Vole.y) = (8x29). 


We have to solve 


1 
8a +2 = 8dr " 40=1) 
2y —4=2dAy => y=-5—5 
An? +y?-1=0 1 1 4 
Lo=12" f= 


Note A ¥ 1, for otherwise the first two equations would be inconsistent. The third 


| V17 
ae The 


equation on the right gives \-—1=4 hence x = 4 


| 1 = 4 
9 cr; and y = + vi 
extrema are therefore 

1 4 1 4 


Pe =) Oi OB 
. (AF iv 2= ( 2/17 With 
From f(Pi) = 2+ V17, f(P2) = 2-— V17, we see the maximum is 2+ V/17, the 


minimum 2 — 17. 


8 


Integral calculus in several variables 


The definite integral of a function of one real variable allowed us, in Vol. I, to 
define and calculate the area of a sufficiently regular region in the plane. The 
present chapter extends this notion to multivariable maps by discussing multiple 
integrals; in particular, we introduce double integrals for dimension 2 and triple 
integrals for dimension 3. These new tools rely on the notions of a measurable 
subset of R” and the corresponding n-dimensional measure; the latter extends the 
idea of the area of a plane region (n = 2), and the volume of a solid (n = 3) to 
more general situations. 

We continue by introducing methods for computing multiple integrals. Among 
them, dimensional-reduction techniques transform multiple integrals into one- 
dimensional integrals, which can be tackled using the rules the reader is already 
familiar with. On the other hand a variable change in the integration domain can 
produce an expression, as for integrals by substitution, that is computable with 
more ease than the multiple integral. 

In the sequel we shall explain the part played by multiple integrals in the correct 
formulation of physical quantities, like mass, centre of gravity, and moments of 
inertia of a body with given density. 

The material cannot hope to exhaust multivariable integral calculus. Integrat- 
ing a map that depends on n variables over a lower-dimensional manifold gives rise 
to other kinds of integrals, with great applicative importance, such as curvilinear 
integrals, or flux integrals through a surface. These in particular will be carefully 
dealt with in the subsequent chapter, where we will also show how to recover a 
map from its gradient, i.e., find a primitive of sorts, essentially. 

The technical nature of many proofs, that are often adaptations of one- 
dimensional arguments, has induced us to skip them!. The wealth of examples 
we present will in any case illustrate the statements thoroughly. 


' The interested reader may find the proofs in, e.g., the classical textbook by R. Courant 
and F. John, Introduction to Calculus and Analysis, Vol. II, Springer, 1999. 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_7, 
© Springer International Publishing Switzerland 2015 
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Figure 8.1. The cylindroid of a map 


8.1 Double integral over rectangles 


Consider a real function f : B — R, defined on a closed rectangle B = [a,b] x 
[c,d] C R? and bounded over it. We call cylindroid of f the three-dimensional 
region C(f;B) between B and the graph of f 


see Fig. 8.1. (The choice of bounds for z depends on the sign of f(z,y).) If f 
satisfies certain requirements, we may associate to the cylindroid of f a number 
called the double integral of f over B. In case f is positive, this number represents 
the region’s volume. In particular, when the cylindroid is a simple solid (e.g., a 
parallelepiped, a prism, and so on) it gives the usual expression for the volume. 

Many are the ways to construct the double integral of a function; we will explain 
the method due to Riemann, which generalises what we saw in Vol. I, Sects. 9.4 
and 9.5, for dimension 1. 

Let us thus consider arbitrary partitions of [a,b] and [c,d] associated to the 
ordered points {%o,21,...,%p} and {yo,y1,---, Yq} 


C=) 71 = = pat Se HO: C=16 <0 <i 9p =, 
with 
Pp Pp q q 
L=65 =|) peels 2S lea) =|) oe) evel 
h=1 h=1 k=1 k=1 
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ma it tots, 


yop - 


ai 


c= Yo 


Figure 8.2. Subdivision of the rectangle B 


The rectangle B is made of p-q products By, = Ip, x Je; we have thus built a 
partition or subdivision of B, say D= {Bax :h =1,...p,k =1,...,q}, which is 
the product of the partitions of [a,b] and [c,d] (Fig. 8.2). Set 


fring = int fey) ond «= My sup f(y) 
(x,y)EBnk (z,y)EBnr 


and define the lower and upper sum of f on B relative to the subdivision D by 


Mnak(Lh = pia as pa ) 


Mnx(tn — £a—1) (Ye — Yr-1) - 


Since f is bounded on B, there exist constants m and M such that, for each 
subdivision D, 


m(b—a)(d—c) < s(D, f) < S(D, f) < M(b—a)(d—c). 


The following quantities are thus well defined 
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Figure 8.3. Lower (left) and upper sum (right) 


respectively called upper integral and lower integral of f over B. As in the 
one-dimensional case, it is not hard to check that 


fesfr 


Definition 8.1 A map f bounded on B = [a,b] x [c,d] = 1 x J is Riemann 


integrable on B if fan 
ae 
JB B 


This value is called the double integral of f over B, and denoted by one 
of the symbols 


is f, | fla,y)dady, f(x,y) dx dy, ; eee 
B B B J JI cJva 


The geometrical meaning is clear when f is positive on B. Given a partition D 
of B, the cylindroid of f is contained inside the solid formed by the union of the 
parallelepipeds with base By, and height Myx, and it contains the solid made by 
the parallelepipeds with the same base and mpz as height (see Fig. 8.3). 

The upper integral is an over-estimate of the region, while the lower integral an 
under-estimate. Thus f is integrable when these two concide, i.e., when the region 
defines a number representing its volume. 
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Examples 8.2 


i) Suppose f is constant on B, say equal K. Then for any partition D, we have 
Mhk = Mak =i, sO 
q 


s(D, f) = S(D, f) = 92 0 K (an — th-1) (Ye — yr-1) = K (b—a)(d—c). 
k=1h=1 
Therefore 


| f =K(b-a)(d—c) =K -area(B). 
B 
ii) Let f be the two-dimensional Dirichlet function on B = [0,1] x [0, 1] 
lL ieee QO, 0< ay 1, 
f(@,y) = { id 


O otherwise. 
For any given partiton D then, 


h 
This shows f is not integrable on B. 


Remark 8.3 Let us examine, in detail, similarities and differences with the defin- 
ition, as of Vol. I, of a Riemann integrable map on an interval [a,b]. We may sub- 
divide the rectangle B = [a,b] x [c,d] without having to use Cartesian products 
of partitions of [a,b] and [c,d]. For instance, we could subdivide B = U_, B; by 
taking coordinate rectangles B; as in Fig. 8.4. Clearly, the kind of partitions we 
consider are a special case of these. 

Furthermore, if we are given a generic two-dimensional step function y associ- 
ated to such a rectangular partition, it is possible to define its double integral in 
an elementary way, namely: let 


y(z,y)=ca, V(a,y)€By, i=1,...,N; 
then 


N 
| C= S- cyarea(B;) . 
B i=1 


Notice the lower and upper sums of a bounded map f : B > R, defined in (8.1), 
are precisely the integrals of two step functions, one smaller and one larger than f. 
Their constant values on each sub-rectangle coincide respectively with the greatest 
lower bound and least upper bound of f on the sub-rectangle. We could have 
considered the set S; (resp. SF) of step functions that are smaller (larger) than 
f. The lower and upper integrals thus obtained, 


[fret forges) and [r=inttf oso esp, 
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Y 


Sika 
(ee 
8 


Figure 8.4. Generic partition of the rectangle B 


coincide with those introduced in (8.2). In other terms, the two recipes for the 
definite integral give the same result. Oo 


As for one-variable maps, it is imperative to find classes of integrable functions 
and be able to compute their integrals directly, without turning to the definition. 
A partial answer to the first problem is provided by the following theorem. Further 
classes of integrable functions will be considered in subsequent sections. 


Theorem 8.4 If f is continuous on the rectangle B, then it is integrable 


on B. 


Our next result allows us to reduce a double integral, under suitable assump- 
tions, to the computation of two integrals over real intervals. 
Theorem 8.5 Let f be integrable over B = [a,b] x [c,d]. 


a) If, for any y € |c,d|, the integral g(y) = ae y) dx exists, the map 
g: [c,d] > R is integrable on |c,d| and 


[t= [ona= [freer 9 
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b) If, for any x € [a,b], the integral h(x) = fo fle, y) dy exists, the map 
h: [a,b] + R is integrable on |a,b| and 


La = fr) TG — EL Fite) «) di: (8.4) 


In particular, if f is continuous on B, 


[t= [ (fi tenan)ae= [pf semen (8.5) 


Formulas (8.3) and (8.4) are said, generically, reduction formulas for iterated in- 
tegrals. When they hold simultaneously, as happens for continuous maps, we say 
that the order of integration can be swapped in the double integral. 


Examples 8.6 


i) A special case occurs when f has the form f(x,y) = h(x)g(y), with h integrable 
on [a,b] and g integrable on [c,d]. Then it can be proved that f is integrable on 
B = |a,b] x [c,d], and formulas (8.3), (8.4) read 


[t= [ [werar}( [ainay) 


This means the double integral coincides with the product of the two one- 
dimensional integrals of h and g. 


ii) Let us determine the double integral of f(x,y) = cos(x+y) over B = [0, 7] x 
(0, 5]. The map is continuous, so we may indifferently use (8.3) or (8.4). For 
example, 


[ff ( [Veoseon) ey) arm [ [sine ty)" ae 


-|" “(sin(e + ©) - sins) dz = V2-1. 


0 


iii) Let us compute the double integral of f(x, y) = xcos xy over B = [1,2] x [0,7]. 
Although (8.3) and (8.4) are both valid, the latter is more convenient here. In 


fact, 
2 T 
| wcossyazay = | (| veos.ny dy) 
B 1 0 


2 Wee ) 9 
=i | sin.xy| d= | sinazdx = ——. 
1 y=0 1 T 


Formula (8.3) involves more elaborate computations which the reader might want 
to perform. 
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Figure 8.5. The geometrical meaning of integrating on cross-sections 


Remark 8.7 To interpret an iterated integral geometrically, let us assume for sim- 
plicity f is positive and continuous on B, and consider (8.4). Given an %o € [a, 5], 
h(xg) = ihe f (xo, y) dy represents the area of the region obtained as intersection 
between the cylindroid of f and the plane x = xg. The region’s volume is the 
integral from a to b of such area (Fig. 8.5). Oo 


8.2 Double integrals over measurable sets 


For one-dimensional integrals the region over which to integrate is always an in- 
terval, or a finite union of intervals. Since double-integrating only over rectangles 
(or finite unions thereof) is too restrictive, we need to introduce the kind of sets 
over which we will discuss integrability. To this end, let 2 be an arbitrary bounded 
subset of R?, and denote by yg : R? > R its characteristic function 


@)= 4) ifeeN, 
ae 0 ifad” 


(see Fig. 8.6). 

We fix an arbitrary rectangle B containing (2, and ask ourselves whether yqQ 
is integrable on B. It is easy to check that if yes, and if B’ is any other rectangle 
containing (2, the function yq is still integrable on B’. Moreover the two integrals 
Jz Xe and J,,,x@ coincide, so the common value can be denoted by [,, xq. That 
said, let us introduce the notion of a measurable set a la Peano-Jordan. 


Definition 8.8 A bounded subset Q C R? is measurable if, for an arbitrary 
rectangle B containing 92, the function vq is integrable on B. If so, the non- 
negative number 


=f x0 
Q 


is the measure (or area) of 2. 
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z 
A 


Figure 8.6. The characteristic function of a set 2 


Examples of measurable sets include polygons and discs, and the above notion 
is precisely what we already know as their surface area. At times we shall write 
area({2) instead of ||. In particular, for any rectangle B = [a,b] x [c,d] we see 
immediately that 


B|=area(B) = f away = ff Pita <, 


Not all bounded sets in the plane are measurable. Think of the points in the square 
(0, 1] x [0, 1] with rational coordinates. The characteristic function of this set is the 
Dirichlet function relative to Example 8.2 ii), which is not integrable. Therefore 
the set is not measurable. 


Another definition will turn out useful in the sequel. 


Definition 8.9 A set 92 has zero measure if it is measurable and |Q2| = 0. 


With this we can characterise measurable sets in the plane. In fact, the next 
result is often used the tell whether a given set is measurable or not. 


Theorem 8.10 A bounded set Q C R? is measurable if and only if its bound- 


ary OL2 has measure zero. 


Among zero-measure sets are: 
i) subsets of sets with zero measure; 
ii) finite unions of sets with zero measure; 


iii) sets consisting of finitely many points; 
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iv) segments; 


v) graphs [= {(a, #@))} ori = {(f(y), y)} of integrable maps f : [a,b] > R; 
vi) traces of (piecewise-)regular plane curves. 


As for the last, we should not confuse the measure of the trace of a plane curve 
with its length, which is in essence a one-dimensional measure for sets that are 
not contained in a line. 

A case of zero-measure set of particular relevance is the graph of a continuous 
map defined on a closed and bounded interval. Therefore, measurable sets include 
bounded sets whose boundary is a finite union of such graphs, or more gener- 
ally, a finite union of regular Jordan arcs. At last, bounded and convex sets are 
measurable. 


Measurable sets and their measures enjoy some properties. 


Property 8.11 Jf 2) and 22 are measurable, 


i) 221 Cc Qo implies |21| < |2|; 


ti) 2) Ug and 23N Q2 are measurable with 


|21 U 23| = [21 | + |25| — |921 ial 23| , hence | U 22| < |21| + |29| ; 


Here is a simple, yet useful result for the sequel. 


° _ 
Property 8.12 Jf is a measurable set, so are the interior 22, the closure 22, 


and in general all sets Q such that QC QC 2. Any of these has measure |Q2|. 


Proof. All sets 2 have the same boundary 02 = 02. Therefore Theorem 8.10 
tells they are measurable. But they differ from one another by subsets of 
O92, whose measure is zero. Oo 


Now we introduce the notion of integrability for bounded maps on a given 
measurable set §2. The method is completely similar to the way we selected meas- 
urable sets. Let f : 2 — R be a bounded map, and consider f : R? > R (called 
the trivial extension of f to R?), defined by 


wo f(x) iffweQ, 
je)={" ifa éQ 


(8.6) 


(see Fig. 8.7). 
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> 


—— 


B 


Figure 8.7. Trivial extension of a map to a rectangle B 


Definition 8.13 The map f is (Riemann) integrable on 02 if f is integrable 


on any rectangle B containing 22. In that case, the integral | f is independent 


B 
i, / 
22 B 


This value is called the double integral of f over 9. Other symbols to 


denote it are 
[fs [ flenaeay, [ fae. 


Observe that if f coincides with the characteristic function of 2, what we have 
just defined is exactly the measure of £2. 

The first large class of integrable maps we wish to describe is a sort of gener- 
alisation of piecewise-continuous functions in one variable. 


Definition 8.14 A map f : 2 —R, bounded on a measurable set 2, is said 


generically continuous on {2 if the discontinuity set has zero measure. 


A bounded, continuous map on 2 is clearly generically continuous, as no dis- 
continuity points are present. An example of a generically continuous, but not 
continuous, function is given by the map f(x,y) = sign(x — y) on the square 
2 = (0,1). 

Then we have the following result. 
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Theorem 8.15 Let f be generically continuous on a measurable set 2. Then 


f is integrable on 92. 


There is a way to compute the double integral on special regions in the plane, 
which requires a definition. 


Definition 8.16 A set Q C R? is normal with respect to the y-axis if 
it is of the form 
Qe gem -an2<b, gia) <y< gG)} 


with gi, g2: [a,b] > R continuous. 
Analogously, 2 is normal with respect to the x-axis if 


Q = {(2,y) € R?:e<y<d, hily) <y < haly)} 


with hy, hg: [c,d] > R continuous. We will shorten this by the terms normal 
for y and normal for «x respectively. 


This notion can be understood geometrically. The set (2 is normal with respect to 
the y-axis if any vertical line x = xo either does not meet , or it intersects it in the 
segment (possibly reduced to a point) with end points (xo, gi(xo)), (xo, g2(x0o)). 
Similarly, (2 is normal for x if a horizontal line y = yo has intersection either empty 
or the segment between (hi (yo), yo), (h2(yo), yo) with 2. Examples are shown in 
Fig. 8.8 and Fig. 8.9. 

Normal sets for either variable are clearly measurable, because their boundary 
is a finite union of zero-measure sets (graphs of continuous maps and segments). 
The next proposition allows us to compute a double integral by iteration, thus 
generalising Theorem 8.5 for rectangles. 


Figure 8.8. Normal sets for y 
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Figure 8.9. Normal sets for x 


Theorem 8.17 Let f : 2 — R be continuous on 2. If Q is normal for y, 


then 
(La =[ el xy) 7) aby, (8.7) 


If Q is normal for x, 


Proof. Suppose {2 is normal for x. From the definition, 92 is a closed and bounded, 
hence compact, subset of R?. By Weierstrass’ Theorem 5.24, f is bounded 
on 92. Then Theorem 8.15 implies f is integrable on 2. As for formula (8.8), 
let B = [a,b] x [c,d] be a rectangle containing 92, and consider the trivial 
extension f of f to B. The idea is to use Theorem 8.5 a); for this, we 
observe that for any y € [c,d], the integral 


= [ Fewes 


exists because « ++ f(z,y) has not more than two discontinuity points 
(see Fig. 8.10). Moreover 


Fe wae = OO esi, 
hi(y) 


fr-feef [[ Fenes) w= [ (fo seanae) dy, 


proving (8.8). Formula (8.7) is similar. Oo 
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Figure 8.10. Reduction formula for an iterated integral 


Also (8.7) and (8.8) are iterated integrals, respectively referred to as formula of 
vertical and of horizontal integration. 


Examples 8.18 


i) Compute 
| (x + 2y)dady, 
Q 


where {2 is the region in the first quadrant bounded by the curves y = 22, 
y = 3-27, x =0 (as in Fig. 8.11, left). The first line and the parabola meet at 
two points with « = —3 and x = 1, the latter of which delimits 9 on the right. 
The set (2 is normal for both x and y, but given its shape we prefer to integrate 
in y first. In fact, 


Q = {(z,y) € R?:0<2<1,27<y<3-27}. 


Hence, 


i 3-07 1 j=3—3? 
| (a + 2y) dxdy = | | (a + 2y) dy | dx = | ley + | dx 
Q 0 Qn 0 y=2xu 


1 
= (x* — x — 12x? + 3x + 9) dx 
0 


xr=1 
a oe 3,3 9 129 
= |-x2° — -2* -—4 = 9 =, 
E hed 5 ls av + 92 - 20 
Had we chosen to integrate in x first, we would have had to write the domain as 


Q = {(z,y) ER? :0<y<3,0<2<holy)}, 


where ha(y) is 
1 2 
3-—y if2<y<3. 


The form of h2(y) clearly suggests why the former technique is to be preferred. 
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Figure 8.11. The set 2 = {(z,y) € R?:0< a < 1,22 < y < 3— 2°} (left) and 
Q={(z,y) ER? :0<y<4, Vy¥< ez < 4} (right) 


ii) Consider 
p (5y + 2a) dx dy 
Q 


where {2 is the plane region bounded by y = 0, y = 4, x = 4 and by the graph of 
x = ,/y (see Fig. 8.11, right). Again, 2 is normal for both x and y, but horizontal 
integration will turn out to be better. In fact, we write 


Q= {(z,y) ER? :0<y<4, Vy<a<a4l}; 


therefore 


4[ pA 4 a 
| (5y + 22) da dy = | / (5y + 2x) dx | dy = | [5icy + Z| dy 
2 0 vy 0 c= Jy 


4 
= / (19 1 16= 5y°/?) dy 
0 
1 yaa 
= ap liga) = 152. 
D - 


For positive functions that are integrable on generic measurable sets 2, e.g., 
rectangles, the double integral makes geometrical sense, namely it represents the 
volume vol(C) of the cylindroid of f 


See Fig. 8.12. 
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Figure 8.12. The cylindroid of a positive map 


Example 8.19 


Compute the volume of 
C= {(2,y,2) €R?:0<y< a, oy? <25,2< ay}, 
The solid is the cylindroid of f(x,y) = xy with base 
a { (2,9) ER?:0<y< a, a? +y? < 25}. 


The map is integrable (as continuous) on 2, which is normal for both x and y 
(Fig. 8.13). Therefore vol(C) = [,, ry da dy. 


Figure 8.13. The set Q = { (2,4) eR*:0<y< 53, fy 2 25 — 7} 
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The region {2 lies in the first quadrant, is bounded by the line y = 3x and the 
circle 2? + y? = 25 centred at the origin with radius 5. For simplicity let us 
integrate horizontally, since 


4 
2 ={(a,y) ER :0<y <3, sy <a< V25—y7h. 


Thus 
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8.2.1 Properties of double integrals 


In this section we state a bunch of useful properties of double integrals. 


Theorem 8.20 Let f, g be integrable maps on a measurable set QC R?. 


i) (Linearity) For any a,8 €R, the map af + 8g is integrable on Q and 


[ofrto=of t+8 f 9. 


ii) (Positivity) If f >0 on Q, then 


[20 


In addition, if f is continuous and Q measurable with |Q| > 0, we have 
equality above if and only if f is identically zero. 


iii) (Comparison/Monotonicity) If f < g on 2, then 


fos Ia 


iv) (Boundedness) The map |f| is integrable on Q and 


als fn. 
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v) (Mean Value Theorem) /f 2 is measurable and 


m= inf Cee Uh sup Fe, Y) 5 
(x,y)EQ (z,y)EQ 


7 
Vo oy Sel 
|2| Sa 


1 
= a f (8.9) 


The number 


is called (integral) mean value of f on (2. 


vi) (Additivity of domains) Let 2 = 2Q)UN2, with 213N 22 of zero measure. 
If f is integrable on 92, and on §22, then f is integrable on 92, and 


ee dng 


ott) If f = g except that on a zero-measure subset of 2, then 


Li bp 


Property vi) is extremely useful when integrals are defined over unions of finitely 
many normal sets for one variable. 


There is a counterpart to Property 8.12 for integrable functions. 


Property 8.21 Let f be integrable on a measurable set 2, and suppose tt is 
defined on the closure 22 of 22. Then f is integrable over any subset (2 such 
° 


fife 


Otherwise said, the double integral of an integrable map does not depend on 
whether bits of the boundary belong to the integration domain. 


that QC QCM, and 


Examples 8.22 


i) Consider 


[a+saray, 
Q 


where 2 = {(x,y) € R?: y > |x|, y < $x +2}. 
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] Z 


y= 5042 


> > O: y = |z| 
| 


Figure 8.14. The set 2 relative to Example 8.22 i) 


The domain $2, depicted in Fig. 8.14, is made of the points lying between the 
graphs of y = |z| and y = $2+2. The set 2 is normal for both z and y; it is more 
convenient to start integrating in y. Due to the presence of y = |x|, moreover, it 
is better to compute the integral on (2 as sum of integrals over the subsets 2; 
and {22 of the picture. The graphs of y = |a| and y = 42 +2 meet for x = 4 and 
c= —4. Then $2; and {22 are, respectively, 

Oy = {(my) CR? 0< a <da<y<5e+3} 


and 


4 1 
Nz = {(x,y) € Rio <a <0, TE <y < 5et 2}. 
Therefore 


4 Lo+2 4 , 
| (1+ 2) dedy = [ ‘i (1 +2) dy de = | [ytay]2°*? de 
21 0 x 0 
ae uel 3 1 3 a: 
_ te, 4 9 me eat ae ee 
[ (eget get dz ae ea 3 
and 


fata dedy =f ( 
( 


0 0 
1 7 20 
2 3 2 

= _ _ 2 d = - or 2 => =. 

[. pr taut ) a 50 + gn" + 7 57 

In conclusion, 

28 20 272 

fore ardy = [ (1+ 2) dry + [ (+2) drdy=—+—=-—. 

Q on ee 3 27 27 


316 8 Integral calculus in several variables 


Figure 8.15. The set 2 relative to Example 8.22 ii) 


ii) Consider 


Y 
dad 
lea ee 


with Q bounded by the circles x? + y? = 25, x? + y? — 2x = 0 and the line 
y= $x (Fig. 8.15). The first curve has centre in the origin and radius 5, the 
second in (22,0) and radius #2. They meet at P = (4,3) in the first quadrant; 
moreover, the line y = 4 intersects x? + y? = 25 in Q = (2V5, 5). The region 
is normal with respect to both axes; we therefore integrate by dividing (2 in two 
parts 2,, Q2, whose horizontal projections on the x-axis are [0,4] and [4,25]: 


Yy y y 
dz dy = dz d dx d 
Lon me Lan ° Ee ss va 


4 af 222-2? 2/5 J25—x2 
Y Y 
i / dy act | / dy | dx 
0 x/2 a+ 4 x/2 el 


Oy gs 2) oS 
~2)/, @+1 8), etl 


io 30 1 2v® 95 
=< / (-5r+30- arts | (—5e+5+—~—)de 
0 c+ 8 4 


5 
= g (10 + 2V5 — 25log5 + 19log(1 + 2V5)) . 
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8.3 Changing variables in double integrals 


This section deals with the analogue of the substitution method for one-dimensional 
integrals. This generalisation represents an important computational tool, besides 
furnishing an alternative way to understand the Jacobian determinant of plane 
transformations. 

Consider a measurable region 2 C R? and a continuous map f that is bounded 
on it. As f is integrable on 2, we may ask how /[, q/ varies if we change variables 
in the plane. 

Precisely, retaining the notation of Sect. 6.6, let 2’ be a measurable region 
and ® : 2’ + 2, (x,y) = ®(u,v) a variable change, like in Definition 6.30. We 
want to write the integral of f(x,y) over 2 as integral over 9’ by means of the 
composite map f = fo®, ie., f(u,v) = f(®(u,v)), defined on Q’. To do that 
though, we ought to recall the integral is defined using the areas of elementary sets, 
such as rectangles, into which the domain is divided. It becomes thus relevant to 
understand how areas change when passing from 2’ to 2 via ®. 

So let us start with a rectangle B’ in 92’, whose sides are parallel to the axes 
u and v and whose vertices are Ugo, Uy = Uo + Auel, Ug = Uo + Aveg and 
U3 = Uo + Aue, + Av eg (Fig. 8.16, left); here Au,Av denote positive, and small, 
increments. The area of B’ is |B’| = AuAv, clearly. 

Denote by B the image of B’ under ® (Fig. 8.16, right); it has sides along the 
coordinate lines of ® and vertices given by x; = ®(u;), 7 = 0,...,3. In order to 
express the area of B in terms of B’ we need to make some approximations, justified 
by the choice of sufficiently small Au and Av. First of all, let us approximate B 
with the parallelogram B? with three vertices at ao, 21, Zo. It is known that its 
area is the absolute value of the cross product of any two adjacent sides: 


|B] ~ |BP| = ||(a1 — wo) A (a2 — 20). 


On the other hand, the Taylor expansion of first order at uo gives, also using (6.25), 


vA ys 
“2 
U2 U3 
~ T) Av 
B” 
Av B’ Lo 
,. : 
~v TAU £1 

uo Au U1 
al > 
uU xv 


Figure 8.16. How a variable change transforms a rectangle 
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O® 
1 — &o = B(u1) — B(uo) ~ F (uo) Au = 1 Au, 
U 


and similarly 


L2— Xo ~ T2Av, with T2 = —(uo). 


Ov 
Therefore 
|B) ~ ||71 A T2|| AuAv = ||7, A 72|| |B’. 
Now (6.27) yields 
[71 A T2|| = | det JB(uo)| , 

so that 

|B| ~ | det J@(up)| |B’. (8.10) 
Up to infinitesimals of order greater than Au and Av then, the quantity | det J®| 


represents the ratio between the areas of two (small) surface elements B’ Cc 2’, B.C 
9, which correspond under ®. This relationship is usually written symbolically as 


dx dy = | det J@®(u, v)|dudv, (8.11) 


where du dv is the area of the ‘infinitesimal’ surface element in (2/ and da dy the 
area of the image in 2. 


Examples 8.23 


i) Call ® the affine transformation ®(u) = Au + b, where A is a non-singular 
matrix. Then B is a parallelogram of vertices #9, £1, Z2, 3, and concides with 
B?”, so all approximations are actually exact. In particular, the position vectors 
along the sides at xo read 
L1— Xp =a Au, LQ —- Xo = anAv, 

where a, and ag are the column vectors of A. Hence 

|B| = |lay A ag|| AuAv = | det A|AuAv = | det J®| |B’), 
and (8.10) is an equality. 


ii) Consider the transformation ® : (7,8) + (rcos@,rsin@) relative to polar 
coordinates in the plane. The image under ® of a rectangle B’ = [ro, ro + Ar] x 
(00, 99 +A] in the r-plane is the region of Fig. 8.17. It has height Ar in the radial 
direction and base ~ rAé@ along the angle direction; but since polar coordinates 
are orthogonal, 


|B] ~ rArAé = r|B’|. 
This is the form that (8.10) takes in the case at hand, because indeed r = 
det J@®(r,0 ) > 0 (recall (6.29)). 


After this detour we are ready to integrate. If D’ = {Bi}ier is a partition of 
92’ into rectangular elements, D = {B; = ®(Bi)}ier will be a partition of Q in 
rectangular-like regions; denote by | det J®;| the value appearing in (8.10) for the 
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‘ a 


> 


TO ro tAr 7 


ay 


Figure 8.17. Area element in polar coordinates 


element B;. If f; is an approximation of f on B; (consequently, an approximation 
of the transform f = f o ® on Bi as well), then by using (8.10) on each element 
we obtain 

iel iel 
Now refining the partition of 9’ further and further, by taking rectangles of 
sides Au, Av going to 0, one can prove that the sums on the right converge 
to fa, f(®(u, v))| det J&(u, v)| dudv, while those on the left to fi, f(x,y) dx dy, 


provided f is continuous and bounded on 2. 
All the above discussion should justify, though heuristically, the following key 
result (clearly, everything can be made rigorous). 


Theorem 8.24 Let ® : 2’ > 22 be a change of variables between measurable 
regions 2’, Q in R?, as in Definition 6.30, with components 


x = plu, v) y= v(u,»). 


If f is a continuous and bounded map on 92, 


[ f@wezay = | Aur dead aldud. (9 
. 


In the applications, it may be more convenient to use the inverse transformation 
(u,v) = U(a,y), such that 2’ = G(Q) and det J&(u,v) = (det JU(a,y)) 


Let us make this formula explicit by considering two special changes of vari- 
ables, namely an affine map and the passage to polar coordinates. 


; aii 412 
For affine transformations, set x = ®(u) = Au+ b where A = ( ) 
a21 22 
b 
and b = ee: hence x = y(u,v) = ai1u+ aigu + bi and y = (u,v) = agaiut 
2 
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a22u + bg. If 2 is measurable and 2! = &~!(2), we have 


| flew) azay =| acral f f(ai1u + aigu + 61, agiu + aggv + bz) dudv. 
Q Qr 


As for polar coordinates, x = y(r,0) = rcosé, y = w(r,0) = rsiné, and 
det J®(r,0) =r. Therefore, setting again 2! = ®~'(Q), we obtain 


| ji pdedy — f(rcos6,rsin 6) rdr dé. 
Q 


QQ’ 


Examples 8.25 
i) The map 


f(a, y) = (a? — y’) log (1+ (@ +9)*) 
is defined on 2 = {(z,y) € R?: x >0,0<y< 2-2}, see Fig. 8.18, right. The 
map is continuous and bounded on Q, so [, f(x,y) dx dy exists. The expression 
of f suggests defining 
u=a2t+y and v=2— 7, 
which entails we are considering the linear map 


A y= du) = 4 


£= pi) = 


i>. 17) 


defined by A = 
eae & 1/2 


) with | det A] = 1/2. The pre-image of 22 under ® 


is the set 
QM ={(u,v) €R?:0<u<2, -u<v<u} 
of Fig. 8.18, left. 


y 
2 
aA 2 
2 


“ou | N« 


Figure 8.18. The sets 2 = {(z,y) € R?: « > 0,0 < y < 2-2} (right) and 2’ = 
{(u,v) ER? :0<u<2, -u<vu <u} (left) 
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Then 


1 
[Woe (1+ e+) drdy = 5 f wetog(t + ut) dudy 
22 


/ 


1 2 U 
| (/ vdv) ulog(1 + u*) du 
0 —u 


1 2 v=u 
= if }»?| ulog(1+u*)du=0. 
4 (0) v=—U 
ii) Consider 
i 
Mew Tae 


over 


Q={(2,y) €R?:0<y< V32,1<a7+y? <4}. 
Passing to polar coordinates gives 


@ ={ir0)cl<7<2,0<0< =} 
(Fig. 8.19), so 


1 r/3 2 ie 
d d = = —— 
[few x dy Loree / (| ie ar) dé 


0 Yy | 

2 Ow i 

3 ys | 
| 0 

| 

| | 

| | 

! ! 
1 2 # - 1 2 


Figure 8.19. The sets 2 = {(z,y) € R?:0<y < V32,1< 27+ y? < 4} (right) and 
2 ={(r,0):1<r<2,0<6< } (left) 
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8.4 Multiple integrals 


Since the definition of multiple integral — or n-dimensional integral — for a map 
of n > 3 real variables closely resembles the one just seen for 2 variables, we 
merely point out the differences with the previous situations due to the increased 
dimension in the case n = 3. 

The role of plane rectangles is now taken by parallelepipeds in space. The 
generic set B = |[a,, bi] x [ag,b2] x [a3,b3] can be broken up into the product of 
partitions of the intervals [a1, bi], [a2, ba], [a3, 63], using the points {xo,71,...,2p}, 
{Yo, Y1,-++,Yq} and {Zo, 21,.-., Zr} respectively. The solid B is thus the union of 
parallelepipeds 

Bure = [©n—1, En] X [yr—1, Ye] X [Ze-1, 2e]- 


Take f : B — R bounded; the lower and upper sums of f over B relative to the 
above partition D are, by definition, 


where 


Make = inf Fay: Zz); Mnke = sup Ie YZ) 
(v,y,2)€ Bare (x,y,z)€Bnre 


A map f is said integrable on B if 


int S(D. 7 )— sups( ), 7}: 
D D 


such value, called the (triple) integral of f on B, is denoted by one of the 


symbols 
Le Ihe [ flene)deayae, 


Here, as well, continuity guarantees integrability. One also has the analogue of 
Theorem 8.5. 


Example 8.26 
Compute [, yz dx dy dz over B = [0,1] x [—1,2] x [0,2]. We have 


°(f) (f ereas) ay) a 


| syzdzedydz= 
B 


I| | 
os, 
a 
c= 
He wo 

Neng 

Xx 

Qo. 

Neng 
NN” 

eo. 

x 

I| 
| co 
o—s 

wo 

x 

Qo. 

XR 

I| 
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To be able to integrate over bounded sets 2 C R°, it is necessary to define 
what ‘measurable’ means in space. In analogy to Definition 8.8, we substitute the 
rectangle B with a parallelepiped containing 2, and put 


l= f xo= | xa. 
Q B 


This is independent of the choice of B. The familiar sets of Euclidean geometry, 
like spheres, cylinders, polyhedra and so on, turn out to be all measurable. This 
notion of measure is indeed the volume, so we shall sometimes write |Q2| = vol({2). 
Definition 8.9, Theorem 8.10 and Property 8.12 hold also in higher dimensions. 
For instance, if D C R? is measurable and g : D — R integrable on D, the graph 
has 3-dimensional measure equal zero. In particular, the traces of regular surfaces 
have, as subsets of R?, zero measure. 


Take f : 22 — R bounded, with §2 measurable. Given an arbitrary paral- 
lelepiped B containing 2, we extend f by setting it to zero on B\ 2, and call the 
extension a One says f is integrable on 7 if Fi is integrable on B, in which case 


[raph 


this number, not depending on B, is the (triple) integral of f on 92, also denoted 


' Ihe [ flewe)dedyae, [ fae. 


Again, Definition 8.14 and Theorem 8.15 adapt to the present situation, and sim- 
ilarly happens for Theorem 8.20. 

A completely analogous construction leads to multiple integrals in dimen- 
sion n > 3. 


We would like to find examples of regions in space where integrals are easy 
to compute explicitly. These should work as normal sets did in the plane, i.e., 
reducing triple integrals to lower-dimensional ones. We consider in detail normal 
regions for the z-axis, the other cases being completely similar. 


Definition 8.27 We call 2 C R® anormal set for z (normal with respect 
to the z-axis, to be precise) if 


QO {aw,2) eR Way) Dygia.g) = 2 a, a), (8.13) 


for a closed measurable region D in R? and continuous maps g1, g2: D> R. 


Fig. 8.20 shows a few normal regions. 
The boundary of a normal set has thus zero measure, hence a normal set is 
measurable. Triple integrals over normal domains can be reduced to iterated double 
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Figure 8.20. Normal sets 


and/or simple integrals. For clarity, we consider normal domains for the vertical 
axis, and let (2 be defined as in (8.13); then we have an iterated integral 


g2(x,y) 
ja =| (es le Uae) | da dy , (8.14) 


see Fig. 8.21, where f is integrated first along vertical segments in the domain. 


Dimensional reduction is possible for other kinds of measurable sets. Suppose 
the measurable set 2 C R? is such that the z-coordinate of its points varies within 
an interval [a,3] C R. For any Zo in this interval, the plane z = z cuts 92 along 
the cross-section 2, (Fig. 8.22); so let us denote by 


Ax a {(x,Yy) iS R? : (x, Y, 20) ‘= 2} 


Figure 8.21. Integration along segments 
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Figure 8.22. Integration over cross-sections 


the xy-projection of the slice 2,,. Thus 2 reads 


Now assume every A, is measurable in R?. This is the case, for example, if the 
boundary of {2 is a finite union of graphs, or if 2 is convex. 

Taking a continuous and bounded function f : 2 + R guarantees all integrals 
ve f(x,y, z) da dy, z € [a,8], exist, and one could prove that 


fs-f[ (/. f(e.4,2) de dy) dz ; (8.15) 


this is yet another iterated integral, where now f is integrated first over two- 
dimensional cross-sections of the domain. 


Examples 8.28 
i) Compute 


| x dx dy dz 
Q 


where {2 denotes the tetrahedron enclosed by the planes x = 0, y = 0, z = 0 and 
x+y+z=1 (Fig. 8.23). The solid is a normal region for z, because 


2 ={(z,y,z) € R?: (z,y) € D,0O<z<1-—2-y} 


where 

D={(a,y) €R?:0<2<1,0<y<1-2}. 
At the same time we can describe {2 as 

2 ={(2,y,z) €R®? :0<2z <1, (2, y) € Az} 
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z 


Figure 8.23. The tetrahedron relative to Example 8.28 i) 


with 


Az ={(2,y) €R? :0<2¢<1-2,0<y<1-2-2}. 
See Fig. 8.24 for a picture of D and A,. Then we can integrate in z first, so that 


l—x-y 
[ eavayae = | (| raz) dedy [ (1-2 ~y)ndrdy 
Q Dp \Jo D 
1 1-2 1 ri =l-2x 
=) (| (1 —2~y)edy) de = -5 f (a-«-y)] eae 
0 0 2 0 y=0 


a i i 
==] (l—2)adz = —. 
ak ea we cla Tr 


Figure 8.24. The sets D (left) and Az (right) relative to Example 8.28 i) 
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But we can also compute the double integral in dz dy first, 


[ varayae = [ (/, earav) a= ([ (a) ar) dz 


ii) Consider 
| Vy? + 22 dr dy dz, 
Q 


where 2 is bounded by the paraboloid x = y?+ 2? and the plane x = 2 (Fig. 8.25, 
left). We may write 


2 = {(@,4,2) € Resi 429 (y,z) € Ay} 
with slices 
Ag = {(y,z) € R?:y? +27 <a}. 
See Fig. 8.25, right, for the latter. We can thus integrate over A, first, and the 


best option is to use polar coordinates in the plane yz, given the shape of the 
region and the function involved. Then 


Qn VE D) 
/ VP FPayde = | i r? dr | dO = —anaV/z, 
As 0 0 3 
and consequently 


ae ie 1 
| VP FP dcdyde = 3x | inte ae. 
e ae 15 


z 


an 


Figure 8.25. Example 8.28 ii): paraboloid (left) and the set A, (right) 
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Notice the region 9 is normal for any coordinate, so the integral may be com- 
puted by reducing in different ways; the most natural choice is to look at (2 as 
being normal for x: 

Q = {(x,y,z) €R?: (y,z) €D, yt22 <2 <2} 
with D = {(y,z) €R?:0<a2<y?+4+27}. The details are left to the reader. 


8.4.1 Changing variables in triple integrals 


Theorem 8.24, governing variable changes, now reads as follows. 


Theorem 8.29 Let ® : 2’ > 2, with Q', Q measurable in R°, be a change 
of variables on 2, and set x = ®(u). If f is a continuous and bounded map 
on 92, we have 


| Heo | f((u)) | det JO(w)| aa’. (8.16) 
. 


The same formula holds in any dimension n > 3. 
Let us see how a triple integral transforms if we use cylindrical or spherical 
coordinates. 


Let ® define cylindrical coordinates in R*, see Sect. 6.6; Fig. 8.26, left, shows 
the corresponding volume element. If Q is a measurable set and 2! = ®~!(Q), 
from formula (6.41) we have 


| J tee ae dy de — f(rcos0,rsin 6, t) rdr dé dt. 
Q Q! 
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Figure 8.26. Volume element in cylindrical coordinates (left), and in spherical coordin- 
ates (right) 
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The variables x,y and z can be exchanged according to the need. Example 8.28 
ii) indeed used the cylindrical transformation ®(r, 6,t) = (t,rcos@,rsin@). 


Consider the transformation ® defining spherical coordinates in R*; the volume 
element is shown in Fig. 8.26, right. If Q’ and are related by Q’ = ®~1(Q), 
equation (6.44) gives 


i} ie go. ide dude — f(rsinycos6,rsinysin§,rcosy)r? siny dr dy dé. 
Q Q! 


Here, too, the roles of the variables can be exchanged. 


Examples 8.30 


i) Let us compute 
i) (a? + y”) dx dydz, 
Q 


QQ being the region inside the cylinder x? + y? = 1, below the plane z = 3 and 
above the paraboloid x? + y? + z = 1 (Fig. 8.27, left). 
In cylindrical coordinates the cylinder has equation r = 1, the paraboloid t = 
1—r?. Hence, 

O0<r<l, 0<0<2r, 1—-r?<t<3 
and 


Qn pl 73 1 ig 
| (x? + y”) dx dydz = | | y r? dt dr dé = an f [re] dr 
Q 0 Jo J1-r? 0 a 


' 4 
=2n [ (2r3 + r°)dr = =r. 
0 3 


Figure 8.27. The regions relative to Examples 8.30 i) e ii) 
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ii) Find the volume of the solid defined by z > \/x2+ y? and x? + y? + 2? — 
z <0. This region 92 lies above the cone z = \/x? + y? and inside the sphere 
ge? + yy? + (z — a = + with centre (0,0, t) and radius 4 (Fig. 8.27, right). 


The volume is given by [ q dx dy dz, for which we use spherical coordinates. Then 


0<0<2n, 0S est. 


The bounds on r are found by noting that the sphere has equation r? = r cosy, 


i.e., r= cosy. Therefore 0 < r < cosy. In conclusion, 


2Qr pm/4 pcosyp 
vol(Q) = I dedydz = | | | r? siny dr dy dé 


2 [ aye 4 e=n/4 
==-7 cos * sin pdy = — =n cos o| = 
a dg 6 


T 
y=0 8 


8.5 Applications and generalisations 


This last section is devoted to some applications of multiple integrals. We will 
consider solids of revolution and show how to compute their volume, and eventually 
extend the notion of integral to cover vector-values functions and integrals over 
unbounded sets in R”. 


8.5.1 Mass, centre of mass and moments of a solid body 


Double and triple integrals are used to determine the mass, the centre of gravity 
and the moments of inertia of plane regions or solid figures. Let us consider, for 
a start, a physical body whose width in the z-direction is negligible with respect 
to the other dimensions x and y, such as a thin plate. Suppose its mean section, 
in the z direction, is given by a region 2 in the plane. Call u(x,y) the surface’s 
density of mass (the mass per unit of area); then the body’s total mass is 


(8.17) 


The centre of mass (also known as centre of gravity, or centroid), of 2 is 
the point G = (xg, yg) with coordinates 


1 1 
ne = — | tu(z,y)drdy, ye= — | yu(x,y) da dy . (8.18) 
mM JQ m IQ 


Assuming the body has constant density (a homogeneous body), we have 
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1 1 
a dd ee ds 
a —ail: eS ae <a ee 


the coordinates of the centre of mass of 2 are the mean values (see (8.9)) of the 
coordinates of its generic point. 
The moment (of inertia) of 2 about a given line r (the axis) is 


2 


where d,-(x,y) denotes the distance of (x,y) from r. In particular, the moments 
about the coordinate axes are 


i= || y? w(x, y) da dy, I = f 2 ulz,y)dedy. 
2 2 


Their sum is called (polar) moment (of inertia) about the origin 


Ip = Ip ly = | (x? + y?)u(a, y) dady = | dj (x,y) u(x, y) da dy , 
22 2 


with do(x,y) denoting the distance between (az, y) and the origin. 
Similarly, for a solid body Q in R? with mass density p(2, y, z): 


1 
m= | u(x, y, 2) dx dydz, LG = — | ru(z, y,z)daxdydz, 
Q UE 


1 1 
ve=— | yx, y, z) dx dy dz, a= = | z(x,y, 2) dar dy dz. 
mila mIQ 


Its moments about the axes are 
i= | (y? + 27) u(x, y, z) da dy dz, 
Q 
Le= i (x? + z*)u(a, y, z) da dy dz, 
Q 


i= | (x? + y*)u(x,y, 2) dx dy dz, 
2 


and the moment about the origin is 


hh=h+i,+i,= | (a? + y? + 2”) u(x, y) da dy. 


Q 
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Figure 8.28. The set (2 relative to Example 8.31 


Example 8.31 


Consider a thin plate 2 in the first quadrant bounded by the parabola x = 1—y? 
and the axes. Knowing its density u(x,y) = y, we want to compute the above 
quantities. 

The region 2 is represented in Fig. 8.28. We have 


1 1—-y? 1 1 
m= | ydeay= | | dx yay= f yd—y)ay= 3, 
‘¢) 0 0 (0) 4 


8.5.2 Volume of solids of revolution 


Let (2 be obtained by rotating around the z-axis the trapezoid T determined by the 
function f : [a,b] > R, y = f(z) in the yz-plane (Fig. 8.29). The region T is called 
the meridian section of 2. The volume of 92 equals the integral [ q dzdydz. 
The region (2,,, intersection of 2 with the plane z = 2p, is a circle of radius f (zo). 
Integrating over the slices 2, and recalling the notation of p. 325, then 


vol(@) -f (/ ardy) dan fl (f(z))* dz, (8.19) 


z 
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x 


Figure 8.29. A solid of revolution 


since the double integral over A, coincides with the area of 2,, that is m( f (z))”. 
Formula (8.19) can be understood geometrically using the centre of mass of the 
section 7’. In fact, 


Jrydy dz 1 apt?) il 2 2 
= dy dz = ———_—~— dz. 
_ J, dydz — area(T) I] ee , ay ae 


Using (8.19) then, 


—_ vol({2) 
2e Oi area(T) 
or 
vol(2) = 2m7yq area(T) . (8.20) 


This proves the so-called Centroid Theorem of Pappus (variously known also as 
Guldin’s Theorem, or Pappus—Guldin Theorem). 


Theorem 8.32 The volume of a solid of revolution is the product of the area 


of the meridian section times the length of the circle described by the section’s 
centre of mass. 


This result extends to the solids of revolution whose meridian section T is not 
the trapezoid of a function f, but instead a measurable set in the plane. The 
examples that follow are exactly of this kind. 


Examples 8.33 


i) Compute the volume of the solid (2, obtained revolving the triangle T of 
vertices A = (1,1), B = (2,1), C = (1,3) in the plane yz around the z-axis. 
Fig. 8.30 shows 7’, whose area is clearly 1. As the line through B and C is 
z= —2y+5, we have 
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* I 
ra eee C 
z=—2y+5 
t,o Sa 
A| | B 

| | 

| | 

| | 

: : . 

1 2 y 


Figure 8.30. The triangle T in Example 8.33 i) 


27 poate 
vol(2) = 27yg = an f ydydz = an | (/ de) udy 
fe 1 Vi 


‘ 8 
=4n [ (2 —y)ydy = 3: 
1 


ii) Rotate the circle T given by (y — yo)? + (z — 20)? = 7?, 0 < r < yo, in the 
plane yz. The solid 2 obtained is a torus, see Fig. 8.31. Since area(T’) = mr? 
and the centre of mass of a circle coincides with its geometric centre, we have 


vol(Q) = 21? ry . Oo 


Figure 8.31. The torus relative to Example 8.33 ii) 
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8.5.3 Integrals of vector-valued functions 


Let f : 2 Cc R” > R”™ be a vector-valued map defined on a measurable set (2 


m 
(with n, m > 2 arbitrary); let f = » fie; be its representation in the canonical 
i=1 


basis of R™. 
Then f is said integrable on 2 if all its coordinates f; are integrable, in which 
case one defines the integral of f over (2 as the vector of R™ 


For instance, if v is the velocity field of a fluid inside a measurable domain Q C R°, 


then ‘ 
va=— | vdQ2 
|2| Ja 


is the mean velocity vector in 2. 


8.5.4 Improper multiple integrals 


We saw in Vol. I how to define rigorously integrals of unbounded maps and integrals 
over unbounded intervals. As far as multivariable functions are concerned, we shall 
only discuss unbounded domains of integrations. For simplicity, let f(x,y) be a 
map of two real variables defined on the entire R?, positive and integrable on 
every disc Br(0) centred at the origin with radius R. As R grows to +00, the discs 
clearly become bigger and tend to cover the plane R?. So it is natural to expect 
that f will be integrable on R? if the limit of the integrals of f over Br(0) exists 
and is finite, as R + +00. More precisely, 


Definition 8.34 Let f : R? — R be a positive and integrable map 
over any disc Br(0). Then f is said integrable on R? if the limit 

lim i f(x,y) dxady exists and is finite. The latter, called improper 
integral of f on R?, is denoted by 


[fle ae dy = tim f(x,y) dx dy. 
R2 R--+00 Br(0) 


Remark 8.35 i) The definition holds for maps of any sign by requiring | f(z, y)| 
to be integrable. Recall that each map can be written as the difference of two 
positive functions, its positive part f4(a,y) = max (f(a, y),0) and its negative 
part f_(«,y) = max (— f(#,y),0), ie, f(e,y) = fy(a,y) — f-(,y); then 
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da dy = da dy — _(x,y) dx dy. 
[fen x dy [fen x dy [Ls (x, y) dx dy 


ii) Other types of sets may be used to cover R?. Consider for example squares 
Qr(0) = |—R, R] x [—R, R] of base 2R, all centred at the origin. One can prove 
that f integrable implies 


lim f(x,y) dady = _lim f(x,y) da dy. 
R-+>+00 Br(0) ( ) R->+00 Qr(0) 


iii) If the map is not defined on all R?, but rather on an unbounded subset 2, one 


considers the limit of : f(x,y) da dy as R + +oo. If this is finite, again by 
Br(0)N2 
definition we set 


f(x,y) dxdy = lim f(x,y) dx dy. 
R++00 J Bp(0)NQ - 


Examples 8.36 
1 
i) Consider f(x, y) = ———————., a map defined on R?, positive and con- 
(3 + 22 + y?)8 
tinuous. The integral on the disc Br(0) can be computed in polar coordinates: 


Qn pR 
i f(x,y) dady = / / Jar = yi dr dé 
= —2n|(3 + aad = 2n( — ==) 


Therefore 
1 20 
(x,y) dx dy = pad _on( 5) = 
4 pean ay V3 V3B+R2) VB 
making f integrable on R?. 


ii) Take f(z, y) = Proceeding as before, we have 


1 
ee ae 


QT 
| Fy) dedy = | ie ——, rd 
Br(0) 


R 
= r | log(1 + ay = wlog(1+ R?). 
Thus 


. _ . 2 _ 
plim | a f(z,y) dady = plim 7 log(1 + R*) = +00 


so the map is not integrable on R?. 


iii) With what we have learned we are now able to compute an integral, of 
paramount importance in Probability, that has to do with Gaussian density: 


+00 . 
s= | eda. 
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Let f(x,y) = e-* —¥" and observe 


| eo?" dr dy = (| er ar) (| ew ww) = $7. 
R2 R R 


On the other hand 


apt? F gy? : Se i =? 
e Y dady = lim e ¥ dxrdy = lim e ' rdrdé 
R2 R--+00 Br(0) R-+00 0 0 


R 
= lim nf -e" | = lim nl—e* )=n. 
0 R-++00 


R-++00 
Therefore 


8.6 Exercises 


1. Draw a rough picture of the region A and then compute the double integrals 
below: 


mr 
a) [a Te dzdy where A= [1,2] x [1,2] 


b) | xy dx dy where B = {(z,y) € R?: 2 € [0,1], 2? <y<1+2} 
B 


) [er drdy — where C = {(z,y) € R?: 223 < y < 2,/z} 
C 


d) [ veavay where D = {(z,y) € R?: y € [0,1], y< a2 <e¥} 

le) [ y° dx dy where £ is the triangle of vertices (0,2), (1,1), (3, 2) 
| f ef aray where F = {(z,y) €R?:0<y<10<2<y} 
12), | weosy dr dy where G is bounded by y = 0, y= x7, x = 1 


/ ye” dx dy where H is the triangle of vertices (0,0), (2,4), (6,0) 
H 


2. Let f(x,y) be a generic continuous map on R?. Find the domain of integration 
in the plane and change the order of integration in the following integrals: 


| [ [ renavar Df sewaver 
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Y 4 
1 


Figure 8.32. The sets A and B relative to Exercise 4 


Ol ff se weyas a) ff tenparay 
4 p2 1 pr/A 
‘ y/2 fay) oo [ [te ¥) ee 


3. Draw a picture of the integration domain and compute the integrals: 


i pris eal 
sl Xv 2 
a) 1 dx dy b) | | e ” dady 
1 pl i pal 
lal f f NE aya d) [J eY dy da 
0 Ja LTY 0 J Ve 
3 79 1 a /2 
e) | / y cos. x7 dx dy fa] ff cos x2\/ 1 + cos? x dx dy 
0 y? 0 arcsin y 


4. Referring to Fig. 8.32, write the domains of integration as unions of normal 
domains, then compute the integrals: 


a) fa? aray b | euaedy 
: : 


Compute | y dx dy over the domain A defined by 
A 


xg a, << 2, O0<y<z 


as R > 0 varies. 


6. As R > 0 varies, calculate the double integral 7 x da dy over A defined by 
A 


Se +a. O0<a<1, =< y=. 


Compute 


10. 


. Determine 


. For any real a calculate 
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| x sin |x? — y| da dy 
A 


where A is the unit square [0,1] x [0,1]. 


i |sinx — y| da dy 
A 


where A is the rectangle [0,7] x [0, 1]. 


/ ye ole—¥"| dx dy 
A 


where A is the square (0, 1] x (0, 1]. 


Draw a picture of the given region and, using suitable coordinate changes, 


compute the integrals: 


(c)) | (ar + 4y?) ara 
Cc 
d) | tyacay 
p 
e) [oe acay 
E 


f) | arctan & da dy 
F x 


da dy 


x 
s) lwo 
fa] f x dx dy 
H 


where A is bounded by y = 22, y = 227, 


where 


where 


where 


where 


where 


where 


where 


and by « = y”, x = 3y? 


B, in the first quadrant, is bounded by 


Y=U, Y= 32, ry = 2, ry =6 


C ={(z,y) € 


D ={(#,y) € 


R? :y>0,1<2?+y? <4} 


R? : a2 + y? <9} 


E is bounded by x = \/4— y? and x = 0 


F = {(a,y) € 


Peleg +o <4, 


ly| < |x|} 


G ={(z,y) €R :y>0,¢+y>0, 


H = {(z,y) € 


3 <a? +y? <9} 


R? ay > 0, a7 +47 <4, 


x? + y? — 2y > 0} 
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By transforming into Cartesian coordinates, compute the following integral in 
polar coordinates 


1 
————————— dr dé 
A’ \/cos? 6+ sin 6/r + 1/r? 


where A’ = {(r,0):0<0< ars 


12. By transforming into Cartesian coordinates, compute the following integral in 
polar coordinates 


/ log r(cos @ + sin @) drdd 


cos 6 


where A’ = {(7,0):0<0< 7 1<rcosé < 2}. 


Use polar coordinates to write the sum of the integrals 


1 x V2 px 2 V4—22 
| / ry dy dr + | Z ydy ar + | | xy dy dx 
1/V2 J /1=2? 1 0 V2 /0 


as a single double integral, then compute it. 


Determine mass and centre of gravity of a thin triangular plate of vertices 
(0,0), (1,0), (0,2) if its density is u(a,y) =14+3xr+4+y. 


A thin plate, of the shape of a semi-disc of radius a, has density proportional 
to the distance of the centre from the origin. Find the centre of mass. 


Determine the moments I, Iy, Io of a disc with constant density (x,y) = py, 
centre at the origin and radius a. 


Let D be a disc with unit density, centre at C = (a,0) and radius a. Verify 
the equation 
Ip =I +0°A 


holds, where Ig and I¢ are the moments about the origin and the centre C, 
and A is the disc’s area. 


18. Compute the moment about the origin of the thin plate C defined by 
C={(2,y) €R?: 27 +y? <4, 2? +y*?-a2>0} 


knowing that its density (x,y) equals the distance of (x,y) from the origin. 
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A thin plate C occupies the quarter of disc x? + y? < 1 contained in the first 
quadrant. Find its centre of mass knowing the density at each point equals the 
distance of that point from the x-axis. 


20. A thin plate has the shape of a parallelogram with vertices (3,0), (0,6), (—1, 2), 
(2,—4). Assuming it has unit density, compute I,, the moment about the 
axis Y. 


21. Determine the following multiple integrals: 


a) [ow —z°)dxdydz where A= [-—1,1] x [0,1] x [0,2] 
A 


b) | 2yaeayae where B={(z,y,z)€R2:0<a2<2z<2, 
: O<y<V4-27} 
c) / azsiny°dxdydz where C = {(z,y,z)€R?:0<2<y<1, 
: y Sz < 2y} 
a y dx dy dz where D, in the first octant, is bounded by 
the planesx+y=l,y+z=1 
[ y dx dy dz where FE is bounded by the paraboloid 
y = 4a? + 42? and the plane y = 4 
i: zdax dy dz where F’, in the first octant, is bounded by the 


plane y = 32 and the cylinder y? + z? = 9 


22. Write the triple integral | f(x, y, z) dx dy dz as an iterated integral in at least 
A 
three different ways, with A being the solid with lateral surface: 


a)a=0,2=2,y2=1-z b) 2? +9? =9,2=0,2=6 


Consider the region A in the first octant bounded by the planes x+y—z+1 = 0 
and x + y = a. Determine the real number a > 0 so that the volume equals 
vol(A) = 2. 


Compute the volume of the region A common to the cylinders x? + y? < 1 and 
go +2" < 1, 


Compute 


1 
j da dy dz 
A 3-2z 
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over the region A defined by 
9z<1+y?+92", O<2< J/9- (y? + 922). 
The plane x = K divides the tetrahedron of vertices (1,0,0), (0,2,0), (0,0, 3), 


(0,0,0) in two regions C;, C2. Determine K so that the two solids obtained 
have the same volume. 


By suitable variable changes, compute the following integrals: 


| / Vx? + y? dx dy dz 
A 


with A bounded by the cylinder x? + y? = 25 and 
the planes z = —1, z=2 


De 


with B bounded by the cylinders x? + z? = 1, x? + z2 = 4 and 
the planes y = 0 and y= z+2 


C) [e+ aarayas 
@ 
with C ={(2,y,z) ER8:1<2<2,2¢7+y?4+27 <4} 


ad) | eardya: 


with D in the first octant and bounded by the spheres «7+ y?+ z?7=1 
and 27+ y?+27=4 


e) | Vu? + y? + 2? dx dy dz 
E 


with E' bounded below by the cone y = % and above by the sphere r = 2 


Compute the following integrals with the help of a variable change: 


1 V1—2z2 2-47 — 2? 
af I. ol. 2 (a? + z7)3/? dy dx dz 
Jo-y? y? 18— /18— 2? —y? 
» [ i i (x? + y? + 27) dzda dy 
Jfar+y? 


Calculate the integral 


16 
| (42° + yy? + a) daz dy dz 
Q 9 


ae 2 2 
where §2 is the solid bounded by the ellipsoid = —-+ 7 + 7 =) 
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Find mass and centre of gravity of Q, the part of cylinder x7+y? < 1 in the first 
octant bounded by the plane z = 1, assuming the density is p(x, y,z) =“+y. 


What is the volume of the solid (2 obtained by revolution around the z-axis of 


T={(y,z)e R722 <y <3,0<2<y, 7 =277 <4)? 


32. The region T, lying on the plane xy, is bounded by the curves y = x”, y = 4, 
x =0 with 0 < x < 2. Compute the volume of §2, obtained by revolution of T 
around the axis y. 


Rotate around the z-axis the region 


T={(2,z)€R?:sinz<a2<4-2z,0<2z<7} 


to obtain the solid (2. Find its volume and centre of mass. 


8.6.1 Solutions 


1. Double integrals: 


a) The region A is a square, and the integral equals 


T= 


ON le 


L. .. 8 
(7log2 — 3log5 — 2arctan2 — 4 arctan 5 + 5") : 


b) The domain B is represented in Fig. 8.33, left. The integral equals J = 2. 


V4 


y=1lt+ea 


Figure 8.33. The sets B relative to Exercise 1. b) (left) and C to Exercise 1. c) (right) 
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Figure 8.34. The sets D relative to Exercise 1. d) (left) and EF to Exercise 1. e) (right) 


c) See Fig. 8.33, right for the region C. The integral is J = 32. 


d) The region D is represented in Fig. 8.34, left. The integral’s value is J = 
4.3/2 _ 32 
ae 45° 


e) The region F can be seen in Fig. 8.34, right. The lines through the triangle’s 


vertices are y = 2, y = (x +1) and y = —x + 2. Integrating horizontally with 
the bounds 1 < y <2 and2—y <2 < 2y—1 gives 


ee ee 147 
[ varay = | (| ide) dy = —. 
E 1 2-y 20 


f) The region F’ is shown in Fig. 8.35, left. Let us integrate between 0 < y < 1 
and 0 < x < y to obtain 


2 ? y 2 1 2 
fe de dy = [ (| eo! a) ay= | e” [x]6 dy 
F 0 \Jo 0 


Figure 8.35. The sets F relative to Exercise 1. f) (left), and G to Exercise 1. g) (right) 
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2 ies 
Figure 8.36. The set H relative to Exercise 1. h) 


¢) The set G is represented in Fig. 8.35, right. We integrate vertically with 0 < 
ecole yg 


1 x 1 : 
[ wcosyaray = | | x cos y dy d= | x[siny], dx 
G 0 \Jo 0 


: 1 ~ al 
=I asin x? da = [cos] = -(1-—cosl). 
; 2 ae 


h) The region H is shown in Fig. 8.36. The lines passing through the vertices 


of the triangle are y = 0, y = —x% +6, y = 2a. It is convenient to integrate 
horizontally with 0 < y <4 and y/2 < «<6 -—vy. Integrating by parts then, 


4 6-y 
| wer aray = | / ye” dz | dy =e° — 9e? — 4. 
H 0 y/2 


2. Order of integration: 


a) The domain A is represented in Fig. 8.37, left. Exchanging the integration 


order gives 
1 pa i i 
[ f tenaver= fof few aray, 
0 JO O vy 


b) For the domain B see Fig. 8.37, right. Exchanging the integration order gives 


mw/2  psina 1 pr/2 
ff tenes ff fewaray. 
0 0 0 arcsin y 
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x x 


Figure 8.37. The sets A relative to Exercise 2. a) (left) and B to Exercise 2. b) (right) 


c) 


f) 


The domain C’ is represented in Fig. 8.38, left. Exchanging the integration 


order gives 
2 log x log 2 2 
[ [ tenaae= ff teyarey. 
1 0 0 ey 


The region D is drawn in Fig. 8.38, right. In order to integrate vertically, we 
must first divide D in two parts, to the effect that 


[fo tenacav= [Pf senayars f° [ penayae. 


The domain F is given in Fig. 8.39, left. Exchanging the integration order 


gives 
4 2 2 p2x 
| F(e,y) andy = ff f(t, y) dy de. 
0 vy/2 0 JO 


F is represented in Fig. 8.39, right. Exchanging the integration order gives 


1 pr/4 m/4 ptany 
ff  fewevar= ff peyacey. 
0 arctan «© 0 0 


x 


Figure 8.38. The sets C relative to Exercise 2. c) (left) and D to Exercise 2. d) (right) 
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ALA 


y = arctan z 


2 x 1 2 


Figure 8.39. The sets FE relative to Exercise 2. e) (left) and F' to Exercise 2. f) (right) 


3. Double integrals: 


a) The domain A is normal for x and y, so we may write 


A={(#,y)€R?:0<y<l1,ry<a<n} 


={(,y)€R’:0< 2S, 0<y<—}. 


See Fig. 8.40, left. 
Given that the integrand map is not integrable in elementary functions in 2, 
it is necessary to exchange the order of integration. Then 


1 T as T xu/m wT 
| / sin x indy = | sin x ay \ ax -| ane ie!" an 
0 dry & 0 0 tv 0 tv 


1 [7 2 
~ | sinzdx = —-. 
T 0 T 


b) The domain is normal for x and y, so 


B={(z,y)¢R’:0<y<ly<a¢<]} 


= {(z,y) © R?:0<2<1,0<y<a}. 


Figure 8.40. The sets A relative to Exercise 3. a) (left) and D to Exercise 3. d) (right) 
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It coincides with the set A of Exercise 2. a), see Fig. 8.37, left. 
As in the previous exercise, the map is not integrable in elementary functions 
in x, so we exchange the order 


tpt. 2 1 ey 1 ; l 
| / e ” dxady -| (| e* ‘y) dx =) se” dx = =(1—e"). 
0 Jy 0 0 0 2 


c) The integration domain is normal for x and y, so we write 


C= {(z,y) €R?:0<a2<1,2¢<y<1} 


= {(r,y) CR? :0<y<1,0<a2<y}. 


It is the same as the set F’ of Exercise 1. f), see Fig. 8.35, left. 
As in the previous exercise, the map is not integrable in elementary functions 
in y, but we can exchange integration order 


1 1. 1 y 
I / SY ayar = | (| Ear) ay 
Ge abo gy 0 90 “ery 
T 


1 y 1 
1 x T 1 
= 7 = arctan =| ay=7 | —dy= 
I viz vo LJ; ig 2 


d) The domain D is as in Fig. 8.40, right, and the integral equals J = (e — 1)/3. 
e) The domain F is represented by Fig. 8.41, the integral is J = + sin el. 
f) The domain is normal for x and y, so 


F = {(x,y) €R?:0<y <1, aresiny <2 < 5} 


={(c,y)€R:0< 25 5,0<y<sing}. 
It is precisely the set B from Exercise 2. b), see Fig. 8.37, right. 


As in the previous exercise the map is not integrable in elementary functions 
in x, so we exchange orders 


1 pr/2 m/2 sin x 
fl / cosnVT Fos? edrdy = | (/ cour VTyeoay dx 
0 arcsin y 0 0 


x 


9 


Figure 8.41. The set F relative to Exercise 3. e) 
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aye i i/F = 
il sinz cos x1\/ 1+ cos? xdx = >| JVtdt = 2v2— 1 
0 1 


3 
4. Double integrals: 
a) The integral is 1. 
b) Referring to Fig. 8.32 we divide B into three subsets: 
B, = {(a,y) € R?:-l<2<0, —a2<y<1} 
By = {(a,y) © R?:0<y<1,0<a<1+y?} 
Bs = {(a,y) € R?:-1<y <0, VW-y<a2<1+y’}. 


After a little computation, 


[ evaray = | cyde dy + | syd dy + | xy dx dy 
B B Bs 


1 Bo 

0 1 1 pl+y? 0 1t+y? 
/ i ydy ar + | : cy andy + | / xy dx dy 
—1 J—2? 0 Jo 1) /~-y 


=), 
5. When 0 < x < 2 the circle x? + y? = R? and the line y = x meet at P = 
(3: rae Hence for a > 2, i.e, R > 2/2, A is empty and the integral is 0. 


So lettO< R< reve? and we have to distinguish two cases: 0 < R < 2 and 
2<R< 2v2. See Fig. 8.42 for A. 
Take 0 < R < 2: using Fig. 8.42 we can write 


J yoray = | yardy + | y dx dy 
A Aj Ag 


Y 


0O<R<2 


Figure 8.42. The set A relative to Exercise 5 
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R x 2 x 
| / yayde + [ | y dy dx 
R/V2 J JR? —2? R Jo 
R 2 


1 1 4 1 
sf 2 Rydee 5 f aPar= 5+ TR V3—2). 
2 J R/V2 2 JR 3. 6 


Now suppose 2< R< 2\/2: then 


2 x 
[ vara = | / y dy dx 
A R/V3 J VRo aa? 


1 1 


= 5 [08 R de = 3 — Re aR 
6. We have 
ve Tro<A< 1. 
[ carey = V2 pe — Lent —1) if1<R< v2, 
5 ifR> V2. 


7. The integrand changes sign according to whether (x,y) € A is above or below 
the parabola y = x? (see Fig. 8.43). Precisely, for any (x,y) € A, 


rsin a? —y| = {Nw ow ify<e" , 
—axsin(z*—y) if y>a?. 


Figure 8.43. The set A relative to Exercise 7 
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Looking at Fig. 8.43 then, we have 


| asin |2? — y| dx dy = | ax sin(a” — y) dx dy — / ax sin(x? — y) dx dy 
A Aj Ag 


1 px? 1 pl 
-| | rsin(a? — y) dydx ~ [ i a sin(x” — y) dy dx 
0 Jo 0 Ja 
1 1 


= fF [ecoste? alg’ ae J [rcoste? —y)]a ae 


0 0 
1 1 
=, v(1 —cos*) de — f x(cos(x* — 1) — 1) dx =1-sin1. 
0 0 


8. The result is J = 7 — 2. 


9. We have 
‘ ; 
, —=(e*-l+a) ifa0, 
[ yet lazay = . 
e ! ifa=0 
= ifa=0. 
2 


10. Double integrals: 

a) The region A is bounded by four parabolas, see Fig. 8.44, left. Set u = a — 
my which define the transformation (u,v) = W(x,y); with that change, A 

Rents the rectangle A’ = [1,2] x [1,3]. The Jacobian of © reads 


Een ee —2y/x2 1/2? 
Ly? = 2a/y? 
YU 4 
y=a 
y = 22” 4 
L=y a 
A 
x = 3y? ry =6 
x e 
£ 


Figure 8.44. The sets A relative to Exercise 10. a) (left) and B to Exercise 10. b) (right) 
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with 4 i 3 
= _ ee 
det JV(x, y) = ae ap ae gu Us 
therefore, if = W—!, we have det J®(u,v) = Aer Thus 


Z 
l= raedy= fof we vag du du ==. 


b) B is bounded by 2 lines and 2 hyperbolas, see Fig. 8.44, right. 
Define (u,v) = V(z,y) by u = zy, uv = Y Then B becomes the rectangle 
"y 
B' = [2,6] x [1,3], and 
y t y 
JW = det JW =2-=2u. 
(x,y) ee / ; et IW (x,y) = 2° = 2u 
Calling 6 = W~!, we have det J®(u, v) = 1/2, so 


ry? ae) | 1 3 f° u? 
—— dedy = —_ — dudv = =[1 2 q 
es ete | ea Ww slog}: [(u 25) . 


6 


1 1 9 
—-] 1 — log 3(208 + log —— 
Tales og(u> + | - 5 108 (208 + log an): 


1 1 
a 
5 og 3 oe 


c) The region C is shown in Fig. 8.45, left. 
Passing to polar coordinates (r,0 ), C transforms into C’ = [1,2] x [0,7], so 


T 2 
[r+ 4y)deay = | [Gr cosa + 4r*sin® or ar ao 
Cc 


= a r?cos@ + r*sin 2g]: a= [ (7 cos + 15 sin? 0) dé 
0 0 


1 
ay (70s 5 (1 — 00526) ) do= Sr. 
0 


Kad 
Kad 


Figure 8.45. The sets C relative to Exercise 10. c) (left) and D to Exercise 10. d) (right) 
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y Ya 


Figure 8.46. The sets E relative to Exercise 10. e) (left) and F' to Exercise 10. f) (right) 


d) For D see Fig. 8.45, right. The integral is J = 0. 


ce) Fig. 8.46, left, shows the region EZ, and the integral is J = $(1 —e~*). 


& 
h 


) 
) 
f) F is represented in Fig. 8.46, right, the integral is J = 0. 
. . . . = 3 2 
) Gis shown in Fig. 8.47, left. The integral equals J = 3y2 
) 


The region H lies in the first quadrant, and consists of points inside the circle 
centred at the origin of radius 2 but outside the circle with centre (0,1) and 
unit radius (Fig. 8.47, right). In polar coordinates H becomes H’ in the (r,0 )- 
plane defined by 0 < 6 < § and 2sin@ < r < 2; that is because the circle 
x? + y? —2y > 0 reads r — 2sin@ > 0 in polar coordinates. Therefore 


mw/2 p2 1 am /2 . 
| rdzdy = [ i rcosdrd = = f cos 6[r*] 5... 9 44 
A 0) 2 sin 0 3 0 


8 n/2 8 1 am /2 
=; / (cos @ — cos @sin? 0) dé = = |sin@ — — sin’ 6 = 2. 
ay, 3 4 ; 


11. If we pass to Cartesian coordinates, the condition 0 < 6 < 2 means we 


2 
only have to consider points in the first quadrant, while r = 44 andr = +, 


Ya Ys 


V3 3 2 x 


Figure 8.47. The sets G relative to Exercise 10. g) (left) and H to Exercise 10. h) (right) 
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correspond to the lines x = 1, y = 1. In the plane (x,y) then, A’ becomes the 
square A = [0,1] x [0,1], so 

1 


1 
dr df = ———————————————— 
A’ /cos? 6+ sin 8/r + 1/r? a Vr? cos? 6 +rsiné +1 


r dr dé 


da dy 


1 a 1 
easers a o Jo Vz2?t+ytl 
A 1 1 
=2f [Va y+] ar =2 | (Va? +2— Va? +1) dr 
0 0 


Va? +2 Vez+1 1 
= a states $2) — A — Sogn + a? +1) 
0 
2+ V3 
= 73—/2+1o . 
aS 


12. J = Alog2 — 2. 


13. Note x € [Za 2] and the curves y = V1 — x? and y = V4 — 2? are semi-circles 
at the origin with radius 1 and 2. The region A in the coordinates (x,y) is made 
of the following three sets: 


Ay = {(a,y) €R?2:1<a2< V2,0<y<2} 
Az = {(a,y) €R?: V2<4<2,0<y< V4-27}; 


see Fig. 8.48. 


Ai 


Figure 8.48. The set A relative to Exercise 13 
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In polar coordinates A becomes A’ = {(r,0):0<0< 4,1<r <2}, so 


w/4  p2 is 
=| r'sindcos#drde = [ | r? sin@cos@ dr dé = —. 
Al 0 1 16 


14. The thin plate is shown in Fig. 8.49, left, and we can write 


A={(a,y) €R?:0<2<1,0<y< 2-22}. 


Hence 


1 2-224 
8 
m(A) = | n(e.y)deay = f i (1+ 32+y)dyde ==, 


1 3 pl p2-2e 3 
veld) = oy | an(e.u)dvdy = 5 f / a(1 +32 +y)dyda =<, 


(A) : ] (x,y) dad app (1+3x2+y)dyd - 
— —_ L fd 
YB m(A) Fa L,Y) ar ay 8 eae Yy y) ay 16” 


3 #11 
3) ig): 


15. The thin plate A is the upper semi-circle of x? + y? = a? (Fig. 8.49, right). 
The distance of (x,y) from the centre (the origin) is \/x? + y?, whence the 
density becomes u(x, y) = K\/x? + y?, with given K > 0. In polar coordinates, 


so the centre of mass has coordinates ( 


Krad? 


m(A) =f ue,yydrdy = f | Kr? dr dé = : 
A 0 JO 


Since the thin plate and its density are symmetric with respect to y, the centre of 
mass has to lie on the axis, so xg(A) = 0. As for the other coordinate: 


1 3 ae 3a 
A = = —_—— 3 as Q — . 
yB(A) ray | males) ded aa | / Kr? sin @ dr dé = 


Therefore the centroid’s coordinates are (0, 3*). 


y= 2-24 


1 x —a a ax 


Figure 8.49. The sets A relative to Exercises 14 (left) and 15 (right) 
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16. In polar coordinates 


20 a 
lo = f (a? + y?)wdzdy =u | | 3 drd@ = Xpa4 
D 0 Jo 2 


By symmetry, I, = I, so the equation Ig = I; + Iy gives immediately 


Ip =I, = 


17. Fig. 8.50 shows the disc D, whose boundary has equation (x — a)? + y? 
or polar r = 2acos@. Thus 


mw /2 2a cos @ 
Ip = | (2? +y?)drdy = | r? dr dé 
D —n/2/0 


an /2 mw /2 3 
= if a’ cos* 6 d@ = sa" | cos* 6d9 = =ra’*, 
mw /2 0) 


To = f We-a)? +P Jdvdy = f (+ 92) dtdy 


Qn a 1 
= | re drdé@ = =na’*. 
0 0 2 


But as A = 7a”, we immediately find Ip = Ig + a? A. 
18. lbp = Sr -2 


ce 


19. Since p(x, y) = y we have 


m(A) = f yazay = | | r’sinOdrdd = 5, 
C 0 0 


Figure 8.50. The disc D relative to Exercise 17 
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m/2 

tp(A) = 3 | rydedy =3 | [ r *sin8 cos dr dd = = 
G 

a /2 3 

yp(A) = af y dedy =3 f i r? sin? 6dr dé = — 67 


20. Iy = 33. 


Q 


21. Triple integrals: 
a) [= —-8. bl =4, c) I= 3(1 —cos 1). 


d) We have 
1 l-y l-y 
| yezayae = [ (| ax | te) d 
D 0 0 0 


1 " 1 
= [= = ——: 
/ y(l — y)° dy 7) 


See Fig. 8.51, left. 
e) Since 


4 
| yarayaz = | (/ tod) 
E 0 Ey 


with Ey, = {(a,z) € R?: 2? +z? = 4}, the integral Sn, da dz is the area of 
Ey, hence 74. Therefore 


4 
16 

| vacayaz =F | y’ dy = —7, 

E 4 Jo 3 


see Fig. 8.51, right. 


Ey x {y} 


Ly 


Figure 8.51. The regions relative to Exercise 21. d) (left) and Exercise 21. e) (right) 
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f) We have 


| zavayaz= | | da i dy | zdz = —. 
F 0 0 0 4 


See Fig. 8.52, left. 


22. Triple integrals: 
a) We have 


1-y? 
r= / a f(2.9,2) Jaedeay =f _ _[ f(x,y, z) da dy dz 
i 4 a 
=) I J rei ie Jaydzae = fo [fT f(x,y, z) dy daz dz 
6) x —J/1-z 
1-y? pl— sf I “ 
=f 7 [ RAG re! re Jazdnay= ff” _| f(x,y, z)dzdydz. 


b) Simply computing, 


3 6 pr/9—y? 6 3 pr/9=y? 
i, i | (22 ded dy = [ i / f(x,y, z) dx dy dz 
=3/0 J=./o-y? ) 0 J—3 J—,/9—y? 
9-— _ q— a 
=f - | (a5 2 Jaydzae = [ i |_— (x,y, z) dy dx dz 
9— — 


_ 


=f fm ff few yacdeay= ff [rene dz dy de. 


J9—a2 


Figure 8.52. The regions relative to Exercise 21. f) (left) and to Exercise 23 (right) 
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23. Integrating in z first, withO<z<a+y+1, gives 


zt+tyt+l 
vol(A) =e dz da dy 
D Jo 


where D = {(z,y) € R?:0<2<a,0< y<a—2y}, see Fig. 8.52, right. The 
computations yield 


a a—x 1 1 
vol(A) -| ; (c+ty+1)dydz= 5a + 30°. 
0 Jo 


3 
Imposing vol(A) = 2 we obtain 
1 1 5 
50 + 3 =5 ie, (a —1)(2a? + 5a +5) =0. 


In conclusion, a = 1. 


24. By symmetry it suffices to compute the volume of the region restricted to the 
first octant, then multiply it by 8. Reducing the integral, 


i pee ie 16 
vol(A) =s/ i | dzdxdy = —. 
0 Jo 0 3 


See Fig. 8.53. 


25. The surface 9z = 1+ y? + 92? is a paraboloid, whereas z = ,/9 — (y? + 9x?) 
is an ellipsoid centred at the origin with semi-axes a = 1, b = 3, c= 3. Then 


1 * 
dz dy dz = da dy } dz. 
As-2 0 3-2 Az 


For the integral inbetween brackets we distinguish 0 < z < $ and s <z<1.Inthe 
first case A, is an ellipse in (x,y) with semi-axes a = =v9 —z2 and b= V9 — 2? 


Figure 8.53. The region relative to Exercise 24 


360 8 Integral calculus in several variables 


Figure 8.54. The regions relative to Exercise 25 


(Fig. 8.5 = left); the integral is the area of A,, whence mab = a7 — z”). In case 
s <2 = 1, A; the region bounded by the ellipses 9x? + y? = 9z — 1 and 
9x? + y? = 9 — z? (Fig. 8.54, right) with 
J/9z-1 J/9— 22 
a =, b, = V9z-1, ag =, bg = V9 — 22. 
Therefore 


/ dx dy = magb2 — 7a1b1 = 7 (9 = 2) = " (92 —1). 
. 3 3 


Returning to the starting integral, 


7 1 ep ae ef C= O71 ae 
Ad 3 Jo 2-2 3 Jijg \ 3-2 + ae 


da dy dz = 
Zz 


26. Both volumes can be found by integrating in x the cross-sections: 


mies [ (/. ay dz) a -{ (/, ay dz) i= wale, 


The set A,, contained in (y, z), is 
Az = {(y,2z) €R?:0<y< 201-2), 0<2<3-32- su} 


Fig. 8.55 shows the product A, x {x}. We must thus have 


K p2(1-a2) p3-3a—sy 1 p2(1-a2) p3—3a—3y 
| i | dedyde = [ | | dzdydz, 
0 0 0 kK JO 0 
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Figure 8.55. The region relative to Exercise 26 


hence 
1-(1-K)?=(1-—K)*, 


from which K = 1-1/2. 
27. Triple integrals: 


a) In cylindrical coordinates A becomes 


Al = {(r,0,t) :0<7r<5,0<60<2n, -1<t<2}. 


Therefore 
2 ple -B 
| Ve FPavdyae= | | i r? dr dé dt = 2507. 
A -iJo Jo 


b) We shall use cylindrical coordinates with axis y, ie., © = rcos#, y =t, z= 
rsin@. Then (see Fig. 8.56) 


B={(@,y,z) €R!:1<2°427<4,0<y<2742} 
becomes 


Bo=1{00 71 <7 2,0 <0 < on, 04 < rend +2}5 
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Figure 8.56. The region relative to Exercise 27. b) 


therefore 
rsin@+2 
[ varayas = [" [ / r? cos 6 dt dr dé 
-[ [fr cos O(r sin 8 + 2) dr dé 
1 2) 3 2 : ae 
= [3 rt) E sin 26" + [Sr |, [sina] =); 
c) 3a. 


d) The region D is, in spherical coordinates, 


D' ={(r,9,0):1<r<2,0< 9,6 ie 


sO 


2 pr/2 pr/2 
[ eavdyae = f / | r® sin? y cos 6 dé dy dr 
D 1 Jo 0 
2 wm /2 a /2 
= (| r° ar) (/ coo (/ ss? pa] 
1 0 0 


e) 4(2 — /3)x. 


8.6 Exercises 363 


28. Triple integrals: 
a) In spherical coordinates 92 is defined by 


=Vl=22e@ <1, 2° 2 <9 9-77, =e 2< 1, 


The first and last constraints determine the inside of the cylinder x? + z? = 1, 
so cylindrical coordinates (along y) are convenient: 


r=rcosé, y=t, z=rsing. 
Thus (2 becomes 
Q ={(r,0,t):0<r<1,0<0 <2, 7? <t<2—r7} 


and the integral reads 


1 pla p2—-r? 
8 
[+ 2)Parayaz = | | / r’ dtd@dr = —r. 
Q 0 Jo Jr? 35 
b) 486 (/2 —1)q. 
29. Let us use generalised spherical coordinates 


x = 2rsinycosé, y = 3rsingsin6, z=A4rcosy 


where dx dydz = 24r? sin y dr dy dé; now 2’ is described by 0 <r<1,0<6< 
27, 0 yo <a. Thus, 


16 on cae i 1536 
| (40? + Sy? +2?) dvdyds = 106-24 | | | r* sin ydy dé dr = ——a 
Q 9 0 Jo Jo ) 


30. Let us first find the mass using cylindrical coordinates. As 92 is 


Y ={(r,0,t):0Sr SLO<SO<F,0<t<h, 


we have 


1 pl pr/2 9 
m= | (e+y)drdyde = [ rl | r?(cos@ + sin 0) dé dr dt = =. 
Q 0 Jo Jo 3 


By symmetry, tg = ya, and, similarly to the previous calculation, 


3 3 pl pl px/2 
[c= > | x(a + y)dxdydz = >i | | r? cos 0(cos 6 + sin 0) dO dr dt 
2 Q 2 0 JO JO 


nm /2 3 1 m/2 
= a. (cos? 6 + cos 6 sin 4) d0 dr dt = —|0 + 5 sin20 + sin? 4] 
3 T 
fei 
Ta 5) UG; 


3 3 1 1 am /2 1 
za= 5 f ole + y)drdyds = 5 | | tr?(cos@ +sin@) dé drdt = =. 
2/9 2Jo Jo Jo 2 
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car 


Figure 8.57. The meridian section relative to Exercise 31 


31. The meridian section T is shown in Fig. 8.57. Theorem 8.32 tells us 


vol(Q) = an f y dy dz. 


a 
Then 
3 y 3 
vol(2) = 2n f (/ y dz ay =2n | (y?-y yr —4) dy 
2 y2—4 2 
26(.3. le. wel a Zao. 
= onlay 3 (¥ 4) |, = 309 5/5). 
ga OF: 


33. The section T is drawn in Fig. 8.58, left. Using Theorem 8.32 the volume is 


vol({2) =2n f vdzdz=2n | (| x da) de =n f ((m — z)? — sin? z) dz 
T 0 s 0 


in z 


But as 2 is a solid of revolution around z, the centre of mass is on that axis, so 
tg = yq = 0. To find zg, we integrate cross-sections: 


1 1 . 
= —— dxdy} d 
2G vol(®) [ rae ayae =a. “(f x v) z 


where A, is the projection on xy of the annulus 92, of Fig. 8.58, right. 


8.6 Exercises 365 


Figure 8.58. Meridian section (left) and region 2, (right) relative to Exercise 33 


The integral | dx dy is the area of Az, which is known to be (a — z)? —sin? z). 


Zz 


In conclusion, 


vol({2) 


” I 
‘a= fi 2((m — z)? — sin? z) dedydz = 
0 Vi 


9 


Integral calculus on curves and surfaces 


With this chapter we conclude the study of multivariable integral calculus. In 
the first part we define integrals along curves in R™ and over surfaces in space, 
by considering first real-valued maps, then vector-valued functions. Integrating a 
vector field’s tangential component along a curve, or its normal component on a 
surface, defines line and flux integrals respectively; these are interpreted in Physics 
as the work done by a force along a path, or the flow across a membrane immersed 
in a fluid. Curvilinear integrals rely, de facto, on integrals over real intervals, 
in the same way as surface integrals are computed by integrating over domains 
in the plane. A certain attention is devoted to how integrals depend upon the 
parametrisations and orientations of the manifolds involved. 

Path and flux integrals crop up in a series of results, among which three are 
pivotal: the Divergence Theorem (also known as Gauss’ Theorem), Green’s The- 
orem and Stokes’ Theorem. They transfer to a multivariable framework the idea 
at the heart of one-dimensional integration by parts, and namely: changing the 
integral over a given domain into an integral over its boundary by modifying the 
integrand function. The aforementioned theorems have a large number of applic- 
ations, in particular differential equations governing physical laws; for instance, 
the Divergence Theorem allows to compute the variation of the flow of a unit of 
matter by means of an integral over the volume, thus giving rise to the so-called 
conservation laws. 

The last part deals with conservative fields, which are gradients of vector fields, 
and with the problem of computing the potential of a given field. That is the mul- 
tivariable version of indefinite integration, which seeks primitive maps on the real 
line. The applicative importance of conservative fields and their potentials is well 
known: in many cases a gravitational field or an electric force field is conservative, 
a fact allowing to compute the force’s work simply by difference of two values of 
the potential. 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_9, 
© Springer International Publishing Switzerland 2015 
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9.1 Integrating along curves 


Curvilinear integrals in several variables are the natural generalisation of the def- 
inite integral of a real map over a real interval. Curves were introduced in Sect. 4.6, 
while their differential aspects were discussed in Sect. 6.5. 

Let y : J = [a,b] ~ R™ (m > 2) be a regular arc, and I’ = (J) its trace. 
Suppose f : dom f C R™” —> Risa map defined on I’, at least, meaning [’ C dom f. 
Suppose further that the composite f ov : [a,b] — R, (f oy)(t) = f(7(t)), is 
(piecewise) continuous on [a, }]. 


Definition 9.1 The integral of f along + is the number 


[f- [x y(t) |Iy/(t)|| ae. 


Sometimes the name ‘curvilinear integral’ can be met. 


The right-hand side of (9.1) is well defined, because the integrand function 
f (¥(2)) |l7’(4)|| is (piecewise) continuous on [a, b]. As + is regular, in fact, its com- 
ponents’ first derivatives are continuous and so is the norm ||7y‘(¢) ||, by composition; 
moreover, f (¥(t)) is (piecewise) continuous by hypothesis. 

The geometrical meaning is the following. Let be a simple plane arc and f 
non-negative along I’; call 


G(f) = {(z,y, 2) € R®: (x,y) Edom f, z = f(x,y)} 
the graph of f. If 
= {(x,y,2) € R®: (2, y) EL, O< z < f(z,y)} 


denotes the upright surface fencing I” up to the graph of f (see Fig. 9.1), it can be 
proved that the area of »’ is given by the integral of f along +y. Say, for instance, 
f is constant equal to h on I’: then the area of = is the product of h times the 
length of I’; by Sect. 6.5.2 the length is ¢(I” = \|-v’(é) || dt, whence 


b 
area(E) = her) = f F(y@)iin'@llae= fF. 


If f is not constant, instead, we can divide the interval J into K sub-intervals 
I, = [ty-1, tx] of width At, = t, —te-1 pulnently small. Call I, = y(J;,) the 
restriction to J;,, whose length is ¢(I},) = = _ lly’ @)|] dé; then let 2, = {(x,y, 2) € 
R3 : (x,y) € Ty, O< z < f(x, y)} be the part of XY’ over I, (see again Fig. 9.1). 
Given any ti € J,, with PX = y(t;) € Ik, we will have 


area(Sa) ~ FP) = f° F(vteR))Il4' lat. 


te-1 
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dom f 


Figure 9.1. The geometrical meaning of integrating along a curve 


Summing over k and letting the intervals’ width shrink to zero yields precisely 
formula (9.1). 

Recalling Definition 9.1, we can observe that the arc length s = s(t) of (6.20) 
satisfies 


ds ; 

—_ = t 
= = ly’ 
which, in Leibniz’s notation, we may re-write as 


ds = |[7'(t)|| de. 


The differential ds is the ‘infinitesimal’ line element along the curve, corresponding 
to an ‘infinitesimal’ increment dt of the parameter t. Such considerations justify 
indicating the integral of (9.1) by 


[ fas. or [te (9.2) 


(in the latter case, the map s = s(t) is written as y = y(t), not to be confused 
with the vector notation of an arc y = 7¥(t)). 


Examples 9.2 


i) Let + : [0,1] > R? be the regular arc y(t) = (t, ¢?) parametrising the parabola 
y = x? between O = (0,0) and A = (1,1). Since ¥/(t) = (1,2t), we have 
lly’ (t) || = V1 + 4¢?. Suppose f : R x [0, +00) + R is the map f(a, y) = 34+ /y. 
The composite f o7¥ is f (y(t) = 3¢4+ Vi? = 4¢, so 


1 
[r=] Aty/1 + 4t? dt; 
Y 0) 
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this is computed by substituting r = 1 + 4¢?, 
E 
[fe3 [ wor=3f pr) = =(5V5 —1). 


ii) The regular arc ¥ : [0,1] > R°, y(t) = (42°, v21?, 4), gives 


y(t) = (#, as 1) whence |/7(¢)|| = V/t4 + 2t2+1 = (1427). 


Take now f(z, y, 2) = 3V2ryz?; then 


1 1 : 9 
P)dt= [a8 + —19] =—. 
fl f= fe a 8 ” 10 0 40 
iii) Parametrise the circle centred in (2,1) with radius 2 by + : [0,27] > R?, 
y(t) = (2+ 2cost,1+2sint). Then 


lly’ (#)|| = V4 sin? t + 4cos? t = 2, Vt. 


For the map f(z, y) = («—2)(y—1)+1 we have f(y) = 4sintcost+1, hence 


20 
ie) (4sint cost + 1) dt = 2[ — cos 2t+t]>” = 4r. 
7Y 0 


Using the curve +, with the same components of + but t ranging in [0, 2k7] (i-e., 
going around k times, instead of only once), we have 


2k 
i ) (4sint cost + 1) dt = 4kr. O 
" 0 


This last example explains that curvilinear integrals depend not only on the 
trace of the curve, but also on the chosen parametrisation. Nonetheless, congruent 
parametrisations give rise to equal integrals, as we now show. 

Let f be defined on the trace of a regular arc ¥ : [a,b] > R™ so that fo + 
is (piecewise) continuous and the integral along + exists. Then f o 6, too, where 
6 is an arc congruent to 7, will be (piecewise) continuous because composition of 
a continuous map between real intervals and the (piecewise-)continuous function 
fF OF: 


Proposition 9.3 Let 7 : [a,b] > R™ be a regular arc of trace I, and f a 
map on I’ such that f o-+ is (piecewise) continuous. Then 


[refs 


for any curve 6 congruent to +. 
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Proof. Let 6 = yoy, with » : [c,d] — [a,b], be any congruent arc to +, so 
6'(7) = ¥' (~(7)) ¢"(7). Then 


d 
ja= [+6 () ls rlar =f Foe) In) e"llar 
-[ f (y(v(7))) lly’ (e(7)) I le") dr. 


Substituting t = y(r) gives dt = y’(r) dr. Note that y(c) = a, y(d) = b if 
y’ > 0 (6 and ¥ are equivalent), or y(c) = b, y(d) = aif y’ < 0 (6 and 
are anti-equivalent). In the first case we obtain 


[i [s0 @)ivolar= fF 


whereas in the second case 


a b 
[t=-[ to@) irota= fro) roe fs. “ 


By the previous proposition the following result is straightforward. 


Corollary 9.4 The curvilinear integral of a function does not change by tak- 


ing opposite arcs. 


Given an arbitrary point ¢ in (a,b) and setting y1 = Yja,cq, Y2 = Yi[c,v}, by 


additivity we have 
| f= | raf f- (9.3) 
i) i fal "2 


This suggests how to extend the notion of integral to include piecewise-regular 
arcs. More precisely, let + : [a,b] + R™ be a piecewise-regular arc and a = ag < 
a, <... <a, = 6 points in [a,b] such that the arcs 7; = Yja,_1,a;], 7 = 1,-..,7 
are regular. Take f, as above, defined at least on I’ such that f o + is (piecewise) 
continuous on [a,b]. By definition, then, we set 


(9.4) 


Remark 9.5 Computing integrals along curves is made easier by Proposition 9.3. 
In fact, 


(9.5) 


where each 6; is an arc congruent to 7;, 1 = 1,...,n, chosen in order to simplify 
the corresponding integral on the right. Oo 
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Example 9.6 


Compute a z*, where y : [0,4] > R? is the following parametrisation of the 
square [0,1] x [0,1] 


yi(t) = (¢,0) tOseaol, 
ae ifl<t<2, 
y3(t) =(8-t,1) if2<t<3, 
“a i) = ha reseed 


y(t) = 


(see Fig. 9.2, left). Consider the parametrisations 
61(t) = 71 (¢) Vets il, 1=1; 


(()=(1t) O<t<1, b&~m, 
63(t) = (¢, 1) C<9<1, Syne, 
di(t)=(0,t) O<t<1, b4r—m% 


(see Fig. 9.2, right). Then 


1 1 1 1 5 
ay Pats f at+ [ Patt | Odt=-—. Oo 
Y 0 0 0 (0) 3 


Curvilinear integrals bear the same properties of linearity, positivity, monoton- 
icity and so on, seen for ordinary integrals. 


Remark 9.7 Let y: [a,b] > R be a regular arc. Its length ¢(7) can be written as 
(= | L. (9.6) 
Y 


The arc length s = s(t), see (6.20) with to = a, satisfies s(a) = 0 and s(b) = 
de \ly’(7)|| dr = e(y). We want to use it to integrate a map f by means of the 


A A 

Y3 63 
V4 i A ‘Y2 04 | \ 62 
O v1 1 O 01 1 


Figure 9.2. Parametrisations of the unit square relative to Example 9.6 


9.1 Integrating along curves 373 


parametrisation 7(s) = y(t(s)), equivalent to , already seen on p. 224. Then 


[a= [= [ reaeyas, 


useful to simplify integrals if we know the arc length’s analytical expression. © 


In view of subsequent applications, it makes sense to consider integrals along 
a simple and regular arc I’, thought of as a subset of R™, see Definition 6.28. 

We already know that all parametrisations of I” by simple, regular arcs are con- 
gruent (Proposition 6.27), and hence give rise to the same integral of a (piecewise-) 
continuous map f along I (Proposition 9.3). For this reason one calls integral of 
f along I’ the number 


where 7+ is any regular and simple parametrisation of I’. Equivalent symbols are 


[ tes, [te [ tee. 


The generalisation to piecewise-regular, simple arcs should be clear. In case I" is 
closed, the integral might be denoted using the symbols 


br f fas. § far, f sae. 


Remark 9.8 The integral of a (piecewise-)continuous f can be defined along a 
regular curve y : J + R’’, where IJ is a bounded, but not closed, interval. For this, 
t+ f(+(t))||y/(2)|| must be integrable (improperly as well) on J, with integral 


[s = [ fre) Iya. 


For example, parametrise the unit semi-circle centred at the origin and lying on 
y>Oby7: ee 1) — R*, y(t . (t, V1-— t?); the curve is regular only on (—1, 1), 


1 
1 1 
— ———_ dt = | arcsint = 


the right-hand-side integral being improper. 0 
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9.1.1 Centre of mass and moments of a curve 


As with thin plates and solid bodies, see Sect. 8.5, we can use curvilinear integ- 
rals to define certain physical quantities related to a thin wire resting along a 
(piecewise-)regular, simple arc I’ Cc R°. 

Let = u(P) be the wire’s linear density (mass per unit of length) at a generic 
P=2=(a,y,z) € I. The total mass will be 


or 


1 1 
nh 90 a fal zqg=— | 2B. 
m m mIJpr 


where d = d(P) is the distance of P € I from the line or point considered. The 
axial moments 


are special cases of the above, and their sum Jo = J; +1, + J, represents the wire’s 
moment about the origin. 


Example 9.9 
Let us determine the moment about the origin of the arc I’ 
e+y+z22=1, y=rz, yd, 
joining A = (1,0,0) to B = (0, v2 v2). We can parametrise the arc by y(t) = 
(V1 — 28, t,t), 0<t < 2, so that 
= J2 
lO = a 


1 — 242 vive 


and 


v2/2 J/2 J2/2 7 
Re (OP og? i de [asta or Se. 
0 / ( ) 7 — 272 [ lo 2 
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9.2 Path integrals 


Integrating vector fields along curves gives rise to the notion of path integral, 
presented below. 

Let J = [a,b] and y: I — R™ be a regular arc with trace [ = y(I). Take a 
vector field f in R™, defined on I’ at least. The composite map foy: th f (y(t) 
from I to R™ is thus defined. We shall assume the latter (piecewise) continuous, 
so that all f;(y(t)) are (piecewise) continuous from J to R. For any t € J, recall 


that '(t) 
re ee 
TO =m) = aay 


is the unit tangent vector at P(t) = y(t). The map f, = f -7, 


f-(t) = (fT) = fly) -7O), 
is the component of f along the tangent direction to at P(t). 


Definition 9.10 The path integral of f along vy is the integral along y 
of the map f;: 


[pref re 


Another name for it is line integral of f along vy. 


By definition of 7 we have 


b b 
[tre s0@) rol wlae= fF) Yee. 


a 


Thus, the integral of f along the path -y can be computed using 


(9.8) 


d 
Writing ee +'(t) as Leibniz does, or dy = -y/(t) dt, we may also use the notation 


dt 
[fe 


There is a difference between the symbol dy (a vector) and the differential dy 
of (9.2) (a scalar). 

The physical meaning of path integrals is paramount. If f models a field of 
forces applied to the trace of the curve, the path integral is the work (the work 
integral) of the force during the motion along -y. The counterpart to Proposition 9.3 
is the following. 
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Proposition 9.11 Let +: [a,b] > R™” be a regular arc of trace I’, f a vector 
field on I’ such that f oy is (piecewise) continuous. Then 


[a -T§ = | ip ee for any arc 6 equivalent to -¥, 
6 Y 


a 


nd 
| f2t5—— i, Joa for any arc 6 anti-equivalent to +. 
6 ay 


Proof. This is a consequence of Proposition 9.3, because unit tangent vectors 
satisfy T5 = T- when 6 ~ +, and Ts = —T when 6 ~ —+. 


Corollary 9.12 Swapping an arc with its opposite changes the sign of the 
path integral: 
| fomn = — f $74. 
=a o 


In Physics it means that the work done by a force changes sign if one reverses the 
orientation of the arc; once that is fixed, the work depends only on the path and 
not on the way one moves along it. 

Here is another recurring definition in the applications. 


Definition 9.13 If y is a regular, closed arc, the path integral of f is said 
circulation of f along y and denoted 


pir 


Examples 9.14 


i) The vector field f : R? > R3, f(z, y, z) = (e*, x+y, y+z) along y : [0,1] > R°, 
+(t) = (t,t?, ¢3), is given by 


flv) =(e, t+, +0) with y(t) = (1, 2¢, 327). 
Hence, the path integral of f on + is 
1 
| f-T= | (aaa a a) (1, tar de 
Y 0 


1 
19 
ah [e’ + 2(t? + 24°) + 3044 + #°)] di=e+ 7. 
0 
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ii) Take the vector field f : R? + R? given by f(x,y) = (y, x). Parametrise the 
ellipse x + ye =1 by 7: [0,27] > R?, y(t) = (3cost, 2sint). Then f(y(t)) = 
(2sint, 3cost) and y'(t) = (—3sint,2 cost). The integral of f along the ellipse 


iS ZeYo: 
Py 


27 
gpa f (2sint, 3 cost) -(—3sint, 2eost) at = 6 f (—sin* t + cos? t) dt 
Y 0 0 


20 20 
=o | (2cos?t — nat = 12 | cos? t dt — 127 = 0, 
0) 


0 
where we have used 
20 


27 9 1 1 
| cos* tdt = | =t + —sin2t = 7. oO 
0 2 4 0 


Path integrals exist on piecewise-regular arcs, too: it is enough to define, as in 
formula (9.4), 


where 7; are regular arcs constituting +. 


At last, if we think geometrically (see Definition 6.28) and take a piecewise- 
regular simple arc I’ C R”™ , the path integral along I" can be defined properly only 
after one orientation on J’ has been chosen, and the orientation depends on the 
tangent vector 7. Then we can define the path integral of f along I’ by 


(9.10) 


where ¥ is any simple, (piecewise-)regular parametrisation of I’ with the chosen 
orientation. Clearly, reversing the orientation has the effect of changing the integ- 
ral’s sign. The circulation of f along a piecewise-regular Jordan arc (closed and 
simple) will be indicated by 
¢ 7% 
‘s 


A different notation for the path integral, based on the language of differential 
forms, is provided in Appendix A.2.3, p. 529. 
9.3 Integrals over surfaces 


In perfect analogy to curves, the integral on a surface of a map in three variables is a 
natural way to extend double integrals over flat regions. This section is dedicated 
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to integrating over surfaces. To review surfaces, see Sect. 4.7, whilst for their 
differential calculus see Sect. 6.7. 

Let R be a compact, measurable region in R?, o : R > R° a compact regular 
surface with trace 3’. The normal vector v = v(u,v) at P = o(u,v) was defined 
in (6.48). Take a map f : dom f C R® — R defined on » at least, and assume 
f og is generically continuous on R. 


Definition 9.15 The integral of f over the surface o is the number 


[t= [ femme ae (9.11) 


As the surface is regular, the function (u,v) +> ||v(u,v)|| is continuous on R, so 
the right-hand-side integrand is generically continuous, hence integrable. 

The definition is inspired by the following considerations. Suppose FR is a rect- 
angle with sides parallel to the (u,v)-axes, and let us divide it into rectangles 
Rak = [Un, Un + Au] x [v~, vz + Av] with lengths Au, Av, such that the interiors 
O° 


Rnk are pairwise disjoint (as in Fig. 9.3, left). The image of Raz is the subset 
Ink Of 7 bounded by the coordinate lines through o(up,vz), o(un + Au, vz), 
a(Un, Vk + Av) and o(up + Au,v, + Av) (Fig. 9.3, right). If Au and Av are 
small enough, the surface is well approximated by the parallelogram IJ; lying on 
the surface’s tangent plane at o(u;,,v%), which is spanned by the tangent vectors 


2 (un, vz) Au and 2 (un, ux) Av (Fig. 9.4). The area Ao of nx, is 


Ao = [Few A OO ay i AuAv = ||v (up, vg) || Aud , 


Ou Ov 


a number that approximates the area of 3’,;. Therefore, we may consider the term 


-c 


vp + Av + 


Uk 


Figure 9.3. Partitions of the domain (left) and of the trace (right) of a compact surface 
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Figure 9.4. Element »',, and corresponding tangent plane Ip, 


do = ||v(u, v)|| dud 


in the right-hand side of (9.11) as an ‘infinitesimal’ surface element on . In 
particular, if the surface is simple, we shall see that the area of »’ is precisely 


[i= fe oiauae, 


Let us suppose the regular compact surface 0 : R — R° is given by o(u,v) = 
ut + vj + p(u,v)k. Recalling (6.49), we have 


|v(u, v)|| = 1+ (2) + (22) 


and so the integral of f = f(x,y, z) on o is 


[t= [ fuvetus) 1+ (%) iS. dudv. 


For example, if o is defined by y(u,v) = uv over the unit square R = [0,1]?, 


and f(x,y, 2) = 2//14+2?+4+y?, then 
fr-f [ 1 ty? 2 dud 
o o Jo Vi+u?+v? 


1 pl l 
-| i uvdudu= —. Oo 
0 Jo 4 


Surface integrals are invariant under congruent parametrisations, as stated by 
the next proposition. 


Example 9.16 
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Proposition 9.17 Let o : R — R?® be a compact surface with trace ©) and 
f a map on 3’ such that foo is generically continuous on R. Then for any 
parametrisation o :R — 5? congruent to oa, 


[=f 


Proof. Call ®: R — R the change of variables such that ¢ = 0 0 ®. By (6.50) 
and Theorem 8.24, 


[t= [ Fe@a) oan) ava 
_ L fle@e. ¥))) || det JH(U, d)v (Bu, d)) || dwdv 
= [ {o@@.2)|uo(@.0) Ider 78, avae 


= f Fo(u.»))|o(u,r)|duaw= ff. 


Oo 


The above result allows us to define integrals over regular, simple, compact 
surfaces ©’ C R? thought of as geometrical surfaces (see Definition 6.34). To be 
precise, one calls integral of f on the surface »’ the quantity 


(9.12) 


where @ is any regular, simple parametrisation of >’. Since all such o are congruent, 
the definition makes sense. Alternatively, one may also write 


[ flovae or [ feo. 


Integrals can be defined over piecewise-regular compact surfaces 4’, meaning 
the union of n regular, simple compact surfaces 37),..., 5’, as of Definition 6.43. 
In such a case, one declares 


Remark 9.18 The definition extends to cover non-compact surfaces, as we did 
for curvilinear integrals (Remark 9.8), with the proviso that the right-hand-side 
map of (9.11) be integrable on R. 
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For instance, the unit hemisphere »’ defined by 
ge? ey? 2? =, 220, 
is not compact in Cartesian coordinates, for 
o(u,v) = (u,v, V1 — u? — v?) 


cannot be prolonged differentiably to an arbitrary open set containing the unit 
disc D. Nonetheless, we may still use (9.9) to find the surface’s area, because the 


map 
1 


y(u,v)|| = 
|Y(u,»)| =§ =—— 


2 


is integrable on D. In fact, 


Qn 1 
: 
il V(u,v dude = | | Serer. 
[ (u,v) J) a : 


9.3.1 Area of a surface 


Via surface integrals we can define the area of a compact surface »’ (piecewise 
regular and simple) thoroughly, by 


Example 9.19 


i) The example of Remark 9.18 adapts to show that the area of the hemisphere 
»} of radius r is 


area( 5’) = | 1 =2nr?, 
x 


as elementary geometry tells us. 


ii) Let us compute the lateral surface ¥’ of a cylinder of radius r and height L. 
Supposing the cylinder’s axis is the segment |0, L] along the z-axis, the surface 37 
is parametrised by o(u,v) =rcosui+rsinuj+vk, (u,v) € R = [0, 27] x (0, L]. 
Easily then, 

v(u,v) =rcosut+rsinuj+0k, 


whence ||v(u, v)|| = 7. In conclusion, 


20 L 
area( 5’) = | | rdudv = 2arL, 
0 0 


another old acquaintance of elementary geometry’s fame. Oo 
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It is no coincidence that the area is found by integrating along a meridian arc. 


The cylinder is in fact a surface of revolution (Example 4.37 iii)): 


Proposition 9.20 Let »’ define the surface of revolution generated by re- 
volving the arc I’, on the plane xz, around the z-axis. Then 


area()) = an | ae. 


it 


Proof. Retaining the notation of Example 4.37 iii) we easily obtain 


v(u,v) = —71(u)94(u) cos vé + 91 (u)7h(u) sin v j 
+(41(u)7} (u) cos? v + y1(u)y4(w) sin? v) ke 


whence 
e(u, wll = 2%) (1 (u))? + 93 (%))?) = n@lly’()IL. 


Above we assumed x = 7;(u) non-negative along the curve. Therefore 


area(S) = [ fe / 7 / a1(u) I" (u) || dude 


=2n | au(u)lly’(w)|ldude = 20 fa, 


As 27a is the length of the circle described by P = (x,0,z) € I’ during the 
revolution around the axis, and 


Ire _ Ire 


oo ea = 2G) 


is the coordinate of the centre of mass G of I’, corresponding to unit density along 
the curve, we can state the formula as 


areal) i= ane Cf), 
This is known as Guldin’s Theorem. 


Theorem 9.21 The area of a surface of revolution 3’ is the product of the 


length of the meridian section times the length of the circle described by the 
arc’s centre of mass. 
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9.3.2 Centre of mass and moments of a surface 
Imagine a thin shell covering a (piecewise-)regular, simple, compact surface 37 C 


R°. Denoting by 4 = p(P) the shell’s density of mass (mass per unit of area) at 
P € 3’, we can compute the total mass 


m= fu 
>») 


and the centre of mass G = (xa, ya, 2a) 


sO 


1 1 1 
ca=— | an, ve= = fom, Z24q=— |] Zp. 
ms m JS mJy 


where d = d(P) is the distance of the generic P € »’ from the line or point given. 
The moments about the coordinate axes 


are special cases; their sum Jp = I; + ly + I, represents the moment about the 
origin. 


9.4 Flux integrals 


After learning how to integrate scalar fields on surfaces, we now turn to vector- 
valued maps and define the fundamental notion of flux integral, which will occupy 
this section. 

Let f be a vector field on R® that is defined (at least) on the trace ©’ of a 
regular, compact surface 0 : R > XY, with R compact and measurable in R?. We 
assume the composite f oo is generically continuous on R. If n = n(u,v) is the 
unit normal to »’, one calls normal component of f the component f, = f-n 
along n. 
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Definition 9.22 The surface integral of f on o is the integral on the 
surface o of the map fr: 


[on [ 


Because of Definition 6.35 we can compute surface integrals by 


[t-m= f flo(ur)) -v(uryduav. (9.13) 


As already seen for path integrals, congruent parametrisations in surface integrals 
possibly entail a sign ambiguity. 


Proposition 9.23 Let o : R — R® be a regular compact surface with trace 
}, f a vector field on X’ such that f oo is generically continuous. Then 


i f:n= / f-1, for any compact surface o equivalent to o, 
o o 
and 


f:n=- i fin, for any compact surface o anti-equivalent toa. 
o 


o 


Proof. The proof descends from Proposition 9.17 by observing that unit normals 
are equal, n = n, if o& is equivalent to o, and opposite, n = —n, if o& is 
anti-equivalent to a. 


The proposition allows us to define the surface integral of a vector field f 
over a (piecewise-)regular, simple, compact surface ©’ seen as embedded in R°. In 
the light of Sect. 6.7.2, it is though necessary to consider orientable surfaces only 
(Definition 6.38). 

Let us thus assume that » is orientable, and that we have fixed an orientation 
on it, corresponding to one of the unit normals, henceforth denoted n. Now we 
are in the position to define the flux integral of f on »' as 


(9.14) 


where o is any simple and (piecewise-)regular parametrisation of »’ inducing 
the chosen orientation. This integral is often called simply flux of f across, or 
through, »’. The terminology stems from Physics; suppose the surface is immersed 
in a fluid of density w = p(a), and v = v(a) is the velocity of the point-particle 
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at P= a. Set f = pv and denote by A» the element of area Ao and normal n. 
Then (f -n) Ao is the volume rate of fluid flow through A’ per unit of time, i.e., 
the discharge. Summing, and passing to the limit, the flux integral of f across »’ 
is the difference between the overall outflow and inflow through the surface »’. 

When the surface »’ is bounded and encloses an open domain {2, one speaks 
about outgoing flux or ingoing flux according to whether the normal n leaves 
2 or enters 92 respectively. 


Example 9.24 


Let us determine the outgoing flux of f(a) = yt — xj + zk through the sphere 
centred in the origin and of radius r. We opt for spherical coordinates and para- 
metrise by o : [0,7] x [0,27] > R®, o(u,v) = rsinucosvi+rsinusinuvg + 
rcosuk (see Example 4.37 iv)). The outgoing normal is 

v(x“) =rsinux =rsinu(zi+ yj + zk), 
see Example 6.36 ii). Therefore (f - v)(@) = rsinuz? = r°sinucos?u, and 
recalling (9.13), we have 


20 T 
4 
[rn-| [pr sinucos? ududu = Sar’ 
oH 0 JO 3 


For the flux integral, too, we may use an alternative notation based on the 
language of differential forms; see Appendix A.2.3, p. 529. 


9.5 The Theorems of Gauss, Green, and Stokes 


The three theorems of the title should be considered multi-dimensional versions 
of the formula of integration by parts. Each one of them allows to transform the 
integral of an expression involving the differential operators of Sect. 6.3.1 on a two- 
or three-dimensional domain, or a surface, into the integral over the boundary of 
an expression without derivatives. 

The importance of such results is paramount, both from the theoretical point 
of view and in relationship to applications. They typically manifest themselves, 
for example, when one formulates a law of Physics (e.g., the conservation of mass 
or energy) in mathematical language (a PDE); but they may also play a role in 
determining the conditions that guarantee the solvability of said equations (exist- 
ence and uniqueness of solutions); at last, several numerical techniques for solving 
equations (such as the finite-volume method and the finite-element method) are 
implemented by using one of the theorems. 

At a more immediate level, these results enable to simplify an integral, and 
crop up when examining special vector fields, like conservatives fields. 

We start by discussing a class of open sets and surfaces, that we will call 
admissible, for which the theorems hold. A first, basic study of Sects. 9.5.2 - 9.5.4 
does not require such level of detail. 
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Figure 9.5. Unit tangent to a portion of the Jordan arc I’, and unit normal rotated 
by 7/2 


9.5.1 Open sets, admissible surfaces and boundaries 


Open sets in the plane. Let I C R? be a piecewise-regular Jordan arc with a 
given orientation. The unit tangent vector t = t12+1t2j exists at all points P of I, 
with the exception of a finite number. At P the normal direction v to I’ (orthogonal 
to t) is thus well defined. In particular, the unit vector n = nyi+noj = tgi—t17 to 
I at P is obtained rotating t clockwise by 7/2 (Fig. 9.5); otherwise said, identifying 
n and t with n + 0k and t+ 0k in R? makes the triple (n, t, k) right-handed, for 
(nAt)-k = 1. (Notice that the unit vector n might not coincide with the principal 
normal of (6.22), whose orientation varies with the curve’s convexity; at any rate 
the two vectors clearly differ by a sign, at most.) Furthermore, I" separates the 
region inside I’ from the external region, by virtue of Jordan’s Curve Theorem 4.33. 


Definition 9.25 We call a bounded open set 22 C R? G-admissible if the 
following hold: 


i) the boundary OQ is a finite union of piecewise-regular, pairwise-disjoint 
Jordan ares Tn. an 


ti) QQ is entirely contained either inside, or outside, each Ty. 


Each point P € 02 will belong to one, and one only, Jordan arc I}, so there 
will be (save for a finite number of points) a unit normal n to Ij, at P, that 
is chosen to point outwards 2 (precisely, all points Q = P + en, with « > 0 
sufficiently small, lie outside 2). We will say n is the outgoing unit normal to 
O02, or for short, the outgoing normal of 012. 

The choice of the outward-pointing orientation induces an orientation on the 
boundary O2. In fact, on every arc I}, we will fix the orientation so that, if denotes 
the tangent vector, the frame (n,t, k) is oriented positively. Intuitively, one could 
say that a three-dimensional observer standing as k and walking along 
Ij, will see 2 on his left (Fig. 9.6). We shall call this the positive orientation 
of 092 (and the opposite one negative). 
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Figure 9.6. A G-admissible open set in the plane 


Open sets in space. The whole discussion on two-dimensional G-admissible open 
sets extends easily to space. For this, we recall that every closed and orientable 
surface divides space in a region enclosed by the surface and one outside it (The- 
orem 6.42). 


Definition 9.26 We call a bounded open set 2Q C R° G-admissible ;f: 


i) its boundary O22 is the union of a finite number of pairwise-disjoint sur- 
faces ie e009 Den 


ii) each Xi, is piecewise regular, simple, orientable and closed; 


iii) Q lies entirely inside or outside every surface S'g. 


For a given G-admissible open set there is a well-defined outgoing normal n 
to 092, which will coincide with the normal to the surface 7, oriented from the 
inside towards the outside of 2 (Fig. 9.7). 

Here are some examples of G-admissible open sets. 


Examples 9.27 
i) The inside of the elementary solids (e.g., parallelepipeds, polyhedra, cylinders, 
cones, spheres), and of any regular deformation of these, are G-admissible. 


ii) We say an open bounded set 2 C R? is regular and normal for z in case 
§ is normal for z as in Definition 8.27, 


Q= { (x,y, 2) = R° : (x,y) € De (x, y) <L2< Bea} 
where D is open in R? with boundary 0D a piecewise-regular Jordan arc, and 
a, are C! maps on D. Such an open set is G-admissible. 
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Figure 9.7. A G-admissible open set in R® (with K = 1) 


The boundary of 2 consists of a single surface %;, piecewise regular, normal, 
orientable and closed; in fact, we can decompose X' = X'g U Yq U Xe, where 


Xp = {(2,y,8 (2, y)) € R*: («,y) € D}, 
a { (a, Y,Q (x,y) ER®: (x,y) € D}, 


de = (2 y, 2) € R? : (x,y) = OD,a ey) S25 B(x,y)} : 
The outgoing unit normal ng to %’g is obtained by normalising 


vale) =~ Soau)i— (ea +k, 


+3 being a local graph (see Example 6.36, i)). 
Similarly, the outgoing unit normal ng to X’g is the unit vector corresponding 
to 


O O 
(x,y) = 5 (a, yyti+ By tr) —k. 
The outgoing normal ny to Xp is 
n=Mnap +Ok, 


where nap is the outgoing unit normal (in two dimensions) of 0D. 
Regular and normal sets for x or y are defined in the same way. 


iii) Let us see how to generalise the above situation. The open sets 91,..., 2K 
form a partition of an open set 2 if the 2; are pairwise disjoint and the union 


of their closures coincides with the closure of 2: 
K 


2=(JQ. with MNQ=0 ifhFk. 

k=1 
We then say an open bounded set 2 of R° is piecewise regular and normal 
for x; (¢ = 1, 2,3) if it admits a partition into open, regular, normal sets for x; 
(see Fig. 9.8 for the two-dimensional picture). Such 2 is G-admissible. a 
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Figure 9.8. Partition of Q C R? into the union of normal sets for y 


Compact surfaces. Henceforth » will denote a piecewise-regular, normal and 
orientable compact surface as of Sect. 6.7.4; let X%, k = 1,...,K, be its faces, 
each of which is normal, regular and compact. 

The notion of S-admissible compact surfaces is relevant in view of Stokes’ 
Theorem. 


Definition 9.28 We call S-admissible a compact, piecewise-regular, nor- 
mal, orientable surface X’ whose faces X,, k =1,...,K, can be parametrised 


by maps on: Rr > Ye where Ry = Q, is the closure of a G-admissible open 
set (2, in the plane. 


Given such an S-admissible, compact surface 4’, we will assume to have fixed 
one orientation by choosing a normal n to »’. On each face X’,, n coincides with 
one of the unit normals to X},, say nz. Without loss of generality we may suppose 
mn; is the unit normal associated to the parametrisation o%, see Definition 6.35. 
(If not, it is enough to swap o, with o, on Rez = {(u,v) € R? : (—u,v) € Re} 
given by ox(u,v) = (—u,v), whose unit normal is opposite to that of o%.) 

By compactness, the unit normal n is defined right up to the boundary of »’ 
(with the exception of finitely many points, at most). For this reason we can choose 
an orientation on 02’. Roughly speaking, the positive orientation is given by 
the walking direction of an ideal observer standing as n that proceeds 
along the boundary and keeps the surface at his left. 

A more accurate definition requires a little extra work. Every point P belonging 
to OS’, except for a finite number, lies on the boundary of exactly one face X, 
(Fig. 9.9); moreover, around P the boundary 0%; is a regular arc I, in R3, given by 
the image under o, of a regular arc A; in R? contained in the boundary of a region 
R,. In other terms, Ay = {,(t) : t € I,}, where yy, is the parametrisation of the 
Jordan arc containing A;, and correspondingly Ij, = {nx(t) = ox (Ye (t)) :te I}; 
the point P € I}, will be image under o, of a point po € Ax identified by t = to, 
hence P = o¢(po) = ox(Ye(to)) = mx(to). To the (column) vector tT = 7;,(to), 
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Figure 9.9. The positive orientation of a compact surface’s boundary 


tangent to A, at po, corresponds the (column) vector 


ills 


ag (to) = Jon (po)¥z(to) 


(recall the chain rule), tangent to I, at P. Let t be the corresponding unit vector. 
Then, if we assume that yz, induces the positive orientation on the Jordan arc 
where A; lies, we will say the orientation of the arc J} C 0’ induced by 
the unit vector t is positive. We can say the same in the following way. Let IT 
be the tangent plane to »’ at P that contains t, and denote by g the unit vector 
orthogonal to t, lying on JJ and pointing outside 4’; then the positive orientation 
of 0» is the one rendering (g, t, 2) a positively-oriented triple (see again Fig. 9.9). 


Example 9.29 

Suppose »’ is given by 

Z = {(x,y,z) € R*: (2,y) €R,z = (z,y)}, (9.15) 
where FR is a closed, bounded region of the plane, the boundary OF is a regular 
Jordan arc I’ parametrised by 7 : I + I’, and y is aC! map on the open set A 
containing 7; let (2 indicate the region inside I’. 
The surface »’ is thus compact, S-admissible and parametrised by a: R > D, 
a(x,y) = ‘eo WD (E. y)). The corresponding unit normal is 

n= Tw : where V=—Yzt— pyj tk. (9.16) 

Vv 

If y parametrises I” with positive orientation (counter-clockwise), also the bound- 
ary 


AL = {n(t) = (n1(t), v2(t), e(ri(), 22) st € TF 
will be positively oriented, that is to say, »' lies constantly on its left. 
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The corresponding unit tangent to 0X’ is t= an where 


ny (t) = (V1.4), 19(8), Ge (V(E)) 14 (1) + Gy (¥(4)-9(E)) - (9.17) 


9.5.2 Divergence Theorem 


The theorem in question asserts that under suitable hypotheses the integral of a 
field’s divergence over an open bounded set of R” is the flux integral of the field 
across the boundary: letting (2 denote the open set and n the outward normal to 
the boundary O02, we have 


; div f = fin. (9.18) 
Q an 

We held back on purpose from making precise hypotheses on (2 and f, as 
multiple possibilities exist. The theorem’s proofs may be sensibly simplified by 
sufficiently restrictive assumptions, at the cost of diminishing its far-reaching im- 
pact. Finding ‘minimal’ hypotheses for its validity is a task beyond the scope of 
our study. 

In the sequel the Divergence Theorem will be stated under the assumption that 
the domain {2 be G-admissible, as discussed in Sect. 9.5.1; far from being the most 
general, our statement will hold nonetheless in the majority of cases of interest. 
The reader that wishes to skip the details might think of a G-admissible set as 
an open bounded set whose boundary is made of finitely many graphs of regular 
maps, locally viewing the open set on the same side. The outward normal of the 
open set will be the outward normal of each graph. 

Let us begin in dimension three, by the following preparatory, but relevant 
irrespectively, result. The proof is available in Appendix A.2.2, p. 524, under more 
stringent, but still significant, assumptions on the domain. Hereafter, Cartesian 
coordinates will be equivalently denoted by x1, x2, x3 or by a, y, z. 


Proposition 9.30 Let the open set Q C R® be G-admissible, and assume 
= O = 
f €C°(Q) with ae EC (Q) 7e 11,23 Then 


i‘ a dedyde = [ fn; do, 
Q Ox; 0Q 


where n; is the ith component of the outward normal to O22. 


Proposition 9.30 is the most straightforward multi-dimensional generalisation 
of the recipe for integrating by parts on a bounded real interval, as we mentioned 
in the chapter’s introduction. Its importance is cardinal, because the Divergence 
Theorems and Green’s Theorem descend easily from it. Let us see the first of these 
consequences. 
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Theorem 9.31 (Divergence Theorem of Gauss) Let 2 Cc R® be a G- 
admissible open set, n the outward normal to OQ. For any vector field f € 


Eo 


[ av facayae = f f-ndo. (9.19) 
Q aa 


Proof. Each component f; of f fulfills the assumptions of the previous proposition, 


sO 
Ofi , 
| SH drdyde = | fing do for 2 = 1,3,5. 
Q Ox; AQ 
Summing over 7 proves the claim. oO 


There is actually a version of the Divergence Theorem in every dimension 
n > 2. We will only show how the two-dimensional form below is a consequence 
of the three-dimensional one; for this we shall use a trick. 


Theorem 9.32 Let the open set QC R? be G-admissible and the normal n 
to 02 point outwards. Then for any vector field f € Clone 


[ aw facay= | f-ndy. (9.20) 
Q aQ 


Proof. Define the open set Q = 2 x (0,1) C R%, which is G-admissible (see 


Fig. 9.10); let @ = f+0ke (c1())°, a vector field constant with respect 
to z for which div ® = div f. Then 


1 
[ av facay = | | div facayde = f div®ardy ae. 
Q 0 J Q 


On the other hand, if N is the outward normal to OQ, it is easily seen that 
f-n=@®-N on OM x (0,1), whereas ®- N =0 on 2 x {0} and 2 x {1}. 


Figure 9.10. From the two-dimensional (2 to the three-dimensional Q 
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Therefore 


1 
| fongy= f/f fendy= | Nao = | ®-Ndo. 
0 0 JON O22x (0,1) oq 


The result now follows by applying the Divergence Theorem to the field 
® on Q. Oo 


Example 9.33 


Let us compute the flux of f(x, y, z) = 27 — 57 + 3k through the lateral surface 
»} of the solid 2 defined by 


a? ty? <9-2z, 0O< 2 <8. 
First, 02 = ’}UBoU Bi, where Bp is the circle of centre the origin and radius 3 


on the plane z = 0, while B, is the circle with radius 1 and centre in the origin 
of the plane z = 8. Since div f = 0, the Divergence Theorem implies 


0= | aivfardyaz= | fin= | fin+ f:n+ f-n. 
Q an x Bo By 
But as 


fen= | (—3) = —27a and fen= | a= ar, 
Bo By Bi 


we conclude that 
| f-n=24r. Oo 
x 


9.5.3 Green’s Theorem 


Other important facts ensue from Proposition 9.30. Let 2.C R° be a G-admissible 
open set and f aC! vector field on 2. As we know, the first component of the curl 


of f is 
Of3 Of 
] = = 
(curl f); Dy Ae 
Therefore integrating over (2 and using the proposition repeatedly, we obtain 
O O 
| (curl f); dxdydz = | es — ee) dx dydz = | (nA f)ido. 
Q Q > Oy Oz dQ 


Identities of this kind hold for the other components of the curl of f. Altogether 
we then have the following result, that we might call Curl Theorem. 


Theorem 9.34 Let Q C R® be open, G-admissible, and n the outward nor- 
mal to OQ. Then, for any vector field f € (C*(Q)) , we have 


[ curt favayae = | nA fdo. (210) 
Q AQ 
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Figure 9.11. Outward normal vector and positive orientation of 02 


If we write the curl as V A f, the formula can be remembered as follows: the 
integral over {2 becomes an integral on Of? and the vector n takes the place of V. 


As we show in the online material, Green’s Theorem should be considered a 
two-dimensional version of the above, and can be easily deduced with the trick 
used for Theorem 9.32. 

One last remark is in order in the run up to Green’s Theorem. Let 2 C R? be 
open and G-admissible. If n = n,t+ nj is the outward normal to 092, the unit 
vector t = —n2gt+ nj. is tangent to O2 and oriented along the positive direction 
of the boundary: a three-dimensional observer standing as k and walking along 
OQ will see 2 constantly on his left (Fig. 9.11). Now recall that given a field 
f=fiit fog € (c1(Q))’, we defined in (6.5) the function curl f = st _ ae 

Finally everything is in place for the statement, whose proof may be found in 
Appendix A.2.2, p. 525. 


Theorem 9.35 (Green) Let 2 C R? be a G-admissible open set whose 
boundary OP) is positively oriented. Take a vector field f = fit + fog in 
(c1(Q))*. Then 


LG-Foew= 9 5-7. (9.22) 


The theorem can be successfully employed to reduce the computation of the 
area of a domain in the plane to a path integral. Fix 2 C R? open and G-admissible 
as in the theorem. Then 


(9.23) 
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In fact, f(x, y) = —yi+ 2x7 has constant curl f = 2, so Green’s Theorem gives 


<7 = 2 | daz dy, 
an Q 
whence (9.23). 


But notice that any vector field with constant curl on {2 may be used to obtain 
other expressions for the area of (2, like 


area(2) = ¢ (-y)t- T= ¢ £9+T. 
an an 
Example 9.36 


Let us determine the area of the elliptical region E = {(x,y) € R?: . + ye <1). 
We may parametrise the boundary by y(t) = acosti+bsint J, t € [0,27]. Then 


1 27 1 27 
area(E) = 5 / (absin t + abcos* t) dt = sab f di = mab: 
0 0 


9.5.4 Stokes’ Theorem 


We discuss Stokes’ Theorem for a rather large class of surfaces, that is S-admissible 
compact surfaces, introduced with Definition 9.28. Eschewing the formal definition, 
the reader may think of an S-admissible compact surface as the union of finitely 
many regular local graphs forming an orientable and simple compact surface. With 
a given crossing direction fixed, we shall say the boundary is oriented positively if 
an observer standing as the normal and advancing along the boundary, views the 
surface on the left. 

First of all we re-phrase Green’s Theorem in an equivalent way, the advantage 
being to understand it now as a special case of Stokes’ Theorem. We can identify 
the closure of 2, in R?, with the compact surface Y = Qx {0} in R® (see Fig. 9.12); 
the latter admits the trivial parametrisation 0 : 22 > 3’, o(u,v) = (u,v,0), and 
is obviously regular, simple and orientable, hence S-admissible. The boundary is 
OS) = 02x {0}. Fix as crossing direction of 4’ the one given by the z-axis: by calling 
n the unit normal, we have n = k. Furthermore, the positive orientation on 02 
coincides patently with the positive orientation of 0X’. The last piece of notation 
is the vector field ® = f + 0k (constant in z), for which curl f = (curl® )3 = 
(curl® ) -n; Equation (9.22) then becomes 


[ (curt) n= ®-7, 
zr as 


which — as we shall see — is precisely what Stokes’ Theorem claims. 

We are then ready to state Stokes’ Theorem in full generality; the proof is 
available in Appendix A.2.2, p. 526, in the case the faces are sufficiently regular 
local graphs. 
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Os) 


Figure 9.12. Green’s Theorem as a special case of Stokes’ Theorem 


Theorem 9.37 (Stokes) Let © C R® be an S-admissible compact surface 
oriented by the unit normal n; correspondingly, let the boundary O' be ori- 
ented positively. Suppose the vector field f, defined on an open set A C R® 


containing 3’, is such that f € Ge Then 


i (eurk fa = i iese (9.24) 
yy ay 


In other words, the flux of the curl of f across the surface equals the path 
integral of f along the surface’s (closed) boundary. 


Example 9.38 
We use Stokes’ Theorem to tackle Example 9.33 in an alternative way. The idea 
is to write f = 22-57 + 3k as f = curl®. Since the components of f are 
constant, it is natural to look for a ® of the form 
@(x,y,2z) = (ait + agy + agz)i + (Bix + Boy + B3z)9F + (Mie + yay + 732)k, 
Se) 
curl® = (42 — §3)¢ + (a3 — y1)9 + (G1 — a2)k. 

Then 2 — 83 = 2, ag —y1 = —5, 61 — ag = 3. A solution is then ®(z,y, z) = 
—5zt+ 327 + 2yk. (Notice that the existence of a field ® such that f = curl® 
is warranted by Sect. 6.3.1, for div f = 0 and 2 is convex.) By Stokes’ Theorem, 


[tne | cue n= | ®-.T. 
Z Ss ay 


We know 02) = 0By U OB), see Example 9.33, where the circle 0Bpo is oriented 
clockwise, while 0B, counter-clockwise. 
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Then 
27 
| ®-7T= / (0i + 9cost7 + 6sintk) - (3costi+3sint 7 + 0k) dt 
Bo 0 


27 
=27 | cos’ t dt = 277, 
0 
27 
[er--f (—407 + 3costj + 2sintk) - (costa+sintj + Ok) dt 
By 0 


20 20 
= 40 | costat—3 | cos? tdt = —37, 
0 0 


so eventually 


i f-n=247. 
ay 


9.6 Conservative fields and potentials 


In Sect. 6.3.1, Definition 6.10, we introduced the notion of a conservative field on 
an open set (2 of R” asa field f that is the gradient of a map y, called the potential 


of f 
f =grady, on 2. 


Path integrals of conservative fields enjoy very special properties. The first one 
we encounter is in a certain sense the generalisation to curves of the Fundamental 
Theorem of Integral Calculus (see in particular Vol. I, Cor. 9.39). 


Proposition 9.39 Jf f = grady is a conservative and continuous field on 
{IR hen 


| f-7 =9(7(0)) — 9(4(a)) 


for any (piecewise-) regular arc y : [a,b] > 22. 


Proof. It suffices to consider a regular curve. Recalling formula (9.8) for path 
integrals, and the chain rule (esp. (6.13)), we have 


(grad y)(1(t)) -'(¢) = Lo(rt), 


so 
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Corollary 9.40 Under the hypotheses of the previous proposition, let IX C 
IR” be a (piecewise-)regular, simple arc oriented by the tangent vector rT. Then 


[ f= 9) - 9), 


where Po and P, are the initial and end points of I’. 


An elementary but remarkable use of this corollary is that the potential of a 
conservative field is defined up to a constant on each connected component of 2. 
Hence potentials behave somehow similarly to primitives in R, namely: 


Proposition 9.41 Two scalar fields y, w are potentials of the same continu- 
ous vector field f on 2 if and only if on every connected component 02; of Q2 


there is a constant c; such that p — Ww = G. 


Proof. It is clear that if y and w differ by a constant on 22;, then Vy = Vw. 
For the converse, fix Po arbitrarily in 2;; given any P € 92;, let I bea 
polygonal path starting at Po and ending at P (such will exist because 9; 
is connected, see Definition 4.13). Then the corollary guarantees 


[ fir =o(P) = o(Ps) = v(P) = o(P), 


from which y(P) — v(P) = v(Po) — ¥(Po) = G. Oo 


Proposition 9.39 and Corollary 9.40 tell us the path integral of a conservat- 
ive field depends only on the end points and not on the path itself. Equivalently, 
arcs joining the same two points give rise to equal path integrals. In particular, 
the integral along a closed arc is zero. That each of these two facts character- 
ise conservative fields is of primary importance. To establish this, we need some 
notation, also useful for later. If y~ : [a,b] > R” is an arc between Py = (a) 
and P; = y(b), we shall write y[Po, Pi] to mean that y goes from Po to P;. The 
opposite arc —+y (see Definition 6.25) joins P, to Po, i.e., —y|Pi, Po] = y|Po, Pil. 
Given y1 = 71[Po, Pi] and y2 = -y2[Pi, Po], by y ~ 1 + 2 we will denote any 
arc -y with the following property: if y : [a,b] — R”, there is a c € (a,b) with 
Vitae} ~ YW and Y>c,o) ~ Ya. (An example can be easily found using increas- 
ing linear maps from [a,c], [c,b] to the domains of 71, -y2 respectively.) Observe 
4y(c) = P; and y = 7¥[Po, P2], so y connects Pp to P2 passing through P;; moreover, 
the traces [\I’, and I satisfy 7 = I, UIy. The symbol y ~ 1 — 72 will stand 
for y ~ ¥1 + (—72) whenever 71 = 7¥1[Po, Pi] and y2 = ‘y2[P2, Pi]. By the ad- 
ditivity of curvilinear integrals, and recalling how they depend upon congruent 
parametrisations (9.5), we have 


[ter fore) f°T if y~yt7e.- (9.25) 
*Y 71 72 
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Now we are in a position to state the result. 


Theorem 9.42 Jf f is a continuous vector field on the open set 2 CR”, the 
following are equivalent: 


i) f is conservative; 


ii) for any (piecewise-)regular arcs 1, Y2 with trace in Q and common end 
points, 


[fr eee 


iii) for any (piecewise-)regular, closed arc y with trace in 22, 


ff-r=0. 


Proof. The implication i) = 7) follows easily from Proposition 9.39 on 7, Y2 
(Fig. 9.13, left). For the converse, we manufacture an explicit potential for 
f. Let 2; denote a connected component of 2 and Po € 92; a given point. 
For any P € 92; of coordinates x = (x1,...,2n), set 


v(a)= f f- 


where yy = ¥[Po, P| is an arbitrary (piecewise-)regular arc with trace inside 
§2;, joining Po and P. The definition of y(a) does not depend on the choice 
of +, by ii). We claim that grad y = f, and will prove it only for the first 
component 


Let Az; # O be an increment such that P+ AP = «+ Arzje; still 
belongs to 2;, and call y|P, P + AP] the curve >(t) = x + te; from P to 


4[P, P+ AP] = (a#+tAz)er 
Pf -P+AP 
v1 P; 
WLP, PI 4(Po, P + AP] 


Po 


Po 


Figure 9.13. Proof of Theorem 9.42 
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P+ AP, with t € [0, Am] if Av; > 0 and t € [Azj,0] if Az, < 0. Let 
y|Po, P| and y|Po,P + AP] be regular, simple curves from Pp to P and 
P+ AP respectively (Fig. 9.13, right). Then +[Po,P + AP] ~ y[Po, P|] + 
4|P,P + AP], and by (9.25) and (9.8), we have 


=e 1 fir-f fir 
Ax, Faia 7[Po,P+AP] [PoP] 


oot 


= f:-T=— fila BF a Udy aa ag My dt. 
Az; a Ax, Jo = 


The last term is the integral average of the map t+> fi(v1 +t, %2,...,2n) 
on the interval between 0 and Az, so the continuity of f and the Mean 
Value Theorem (Vol. I, Thm. 9.35) force 


pla + Aries) — 9(x) 


= Po sies wy 
re fi(ai1 +t, x2 es) 


for a certain ¢ with |t] < |Az,|. The limit for Ax; — 0 proves the claim. 

The equivalence of ii) and ii) is an immediate consequence of (9.25): if 
“i, ¥ fulfill ii) then each y ~ 41 — 72 satisfies iii), while a + satisfying iii), 
as seen above, is the difference of 71, y2 with common end points. Oo 


The question remains of how to characterise conservative fields. A necessary 
condition is the following. 


Property 9.43 Let f be aC! vector field on Q C R”. If it is conservative, 
we have 


Ofi _ Of; 
GAR Ox; 


Vi Fj. (9.26) 


Proof. Take a potential y for f, so f; = = fori = 1,...,n; in particular y € 
C?(Q). Then formula (9.26) is nothing but Schwarz’s Theorem 5.17. O 


We met this property in Sect. 6.3.1 (Proposition 6.8 for dimension two and Pro- 
position 6.7 for dimension three); in fact, it was proven there that a conservative 
C! vector field is necessarily irrotational, curl f = 0, on 2. 

At this juncture one would like to know if, and under which conditions, being 
curl-free is also sufficient to be conservative. From now on we shall suppose {2 
is open in R? or R®. By default we will think in three dimensions, and highlight 
a few peculiarities of the two-dimensional situation. Referring to the equivalent 
formulation appearing in Theorem 9.42 iii), note that if + is any regular, simple 
arc whose trace I’ is the boundary of a regular and simple compact surface 3’ 
contained in 92, Stokes’ Theorem 9.37 makes sure that f irrotational implies 
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ftr=fcunf-n-o. 
¥ 


Nevertheless, not all regular and simple closed arcs I’ in (2 are boundaries of a 
regular and simple compact surface »’ in 2: the shape of 22 might prevent this 
from happening. If, for example, 2 is the complement in R® of the axis z, it is 
self-evident that any compact surface having a closed boundary encircling the axis 
must also intersect the axis, so it will not be contained entirely in 2 (see Fig. 9.14); 
in this circumstance Stokes’ Theorem does not hold. This fact in itself does not 
obstruct the vanishing of the circulation of a curl-free field around 2; it just says 
Stokes’ Theorem does not apply. In spite of that, if we consider the field 


Yy ‘ x : 


on §2, undoubtedly curl f = 0 on 2, whereas 


fp fit=2m 


with I’ the counter-clockwise unit circle on the xy-plane centred at the origin. This 
is therefore an example (extremely relevant from the physical viewpoint, by the 
way, f being the magnetic field generated by a current along a wire on the z-axis) 
of an irrotational vector field that is not conservative on (2. 


The discussion suggests to narrow the class of open sets (2, in such a way that 
f  curl-free = f conservative 


holds. With this in mind, let us give a definition. 


Figure 9.14. A compact surface »’, with boundary I’, crossed by the z-axis 
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Definition 9.44 An open connected set 2 C R” is simply connected if the 


trace I’ of any arc in 92 can be deformed with continuity to a point, always 
staying within 22. 


More precisely: 2 is simply connected if any closed curve y : J > R” belongs to a 
one-parameter family of closed curves y, : J > R”", 0 < s <1, with the following 
properties: 


i) the map (t,s) > y(t) from J x [0,1] to R” is continuous; 
ii) each trace , = ys(J) is contained in 2; 
iii) y. = y and ‘7 is constant, i.e., [ is a point. 


One says y is homotopic to a point, and the map (t,s) +> s(t) is known as 
a homotopy. 


Simply connectedness can be defined alternatively. For instance, in dimension 
2 we may equivalently demand that 


e the complement of 2 in R? is a connected set, 


or 
e for any Jordan arc y in 2, the interior >); is entirely contained in 92. 


Naively, an open connected set in the plane is simply connected if it has no 
‘holes’; an (open) annulus is thus not simply connected (see Fig. 9.15, left). 

The situation in three dimension is more intricate. The open domain enclosed 
by two concentrical spheres is simply connected, whereas an open torus is not 
(Fig. 9.15, middle and right). 

Likewise, the open set obtained by removing one point from R? is simply con- 
nected, but if we take out a whole line it is not simply connected any longer. It 
can be proved that an open connected set 2 in R® is simply connected if and only 
if for any (piecewise-)regular Jordan arc I’ in 2, there exists a regular compact 
surface »’ in (2 that has I’ as boundary. 


Figure 9.15. A simply connected open set (middle) and non-simply connected ones (left 
and right) 
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Figure 9.16. A closed simple arc (left) bordering a compact surface (right) 


The above characterisation leads to the rather-surprising result for which any 
curve in space that closes up, even if knotted, is the boundary of a regular compact 
surface. This is possible because the surface is not required to be simple; as a matter 
of fact its trace may consist of faces intersecting transversely (as in Fig. 9.16). 

Returning to the general set-up, there exist geometrical conditions that guar- 
antee an open set {2 C R” is simply connected. For instance, convex open sets 
are simply connected, and the same is true for star-shaped sets; the latter admit 
a point Py € 2 such that the segment PoP from Pp to an arbitrary P € {2 is con- 
tained in 92 (see Fig. 9.17). In particular, a convex set is star-shaped with respect 
to any of its points. 


Finally, here is the awaited characterization of conservative fields; the proof is 
given in Appendix A.2.2, p. 527. 


Theorem 9.45 Let 2 CR”, with n = 2 or 3, be open and simply connected. 


A vector field f of class C! on Q is conservative if and only if it is curl-free. 


A similar result ensures the existence of a potential vector for a field with no 
divergence. 


Remark 9.46 The concepts and results presented in this section may be equi- 
valently expressed by the terminology of differential forms. We refer to Ap- 
pendix A.2.3, p. 529, for further details. Oo 


Po 
Pi 
P2 


Figure 9.17. A star-shaped set for Po 


404 9 Integral calculus on curves and surfaces 


9.6.1 Computing potentials explicitly 


Suppose f is conservative on an open (without loss of generality, connected) set 
92 CR”. We wish to find a potential for f, which we already know will be defined 
up to a constant. Let us explain two different methods for doing this. 

The first method uses the representation seen in Theorems 9.42 and 9.45. To 
be precise, fix a point Pp and define the potential at every P € 92 of coordinate x 
by 


where I’|Po, P| C 2 is a simple and (piecewise-)regular arc from Py to P. The 
idea is to choose the path to make the integral as simple as possible to compute 
(recall that the integral is independent of the path, by part ii) of Theorem 9.42). 
In many cases, the best option is a polygonal path with segments parallel to the 
coordinate axes; if so, over each segment the integrand f - 7 depends only on one 
component of f. 


Example 9.47 


Consider the field 
7 y ay 
defined on the open set 2 between the branches of the hyperbola ry = —1/2. It 
is not hard to convince oneself that (2 is star-shaped with respect to the origin, 
and curl f = 0 on {2; hence the vector field is conservative. To find a potential, 
we may use as path the segment from (0,0) to P = (a, y) given by y(t) = (ta, ty), 
O0<t<1. Then 
Qxyt 


(eu) = f Ff (y(t) y(t) dt = > Jit lage 


Substituting u = 1+ 2ryt?, so du = 4zxyt dt, we have 


=4/ la say = Ls 


i 
dt. 


y(z,y) =| OT aa vial, 


The generic potential of f on 9 will be 
y(z,y) = /14+2ry4+c. 


The second method we propose, often easier to use than the previous one, 
consists in integrating with respect to the single variables, using one after the 
other the relationships 


Ov Ov Ov 

eof, ao Hhes es ys BHM: 

Ox 0x2 OX, 
We exemplify the procedure in dimension 2 and give an explicit example for di- 
mension 3. 
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From 5 
5, (Y) = Tie y) 


we obtain 
p(z,y) = Fi(a,y) + vi(y), 


where F\(x,y) is any primitive map of fi(x,y) with respect to 2, i.e., it satisfies 


OF 
=e, y) = fi(x,y), while ~1(y), for the moment unknown, is the constant of the 


Ox 


previous integration in x, hence depends on y only. To pin down this function, we 
differentiate the last displayed equation with respect to y 


dy 
dy 


OF, 


Ge se (a, ) — Feu) = fale.) - le D=20. 


OF 
Note fa(a,y) - 26 
by (9.26), since 


O fe 0 OF, _ Ofe 7 _ 


(x,y) depends only on y, because its x-derivative vanishes 


Calling G(y) an arbitrary primitive of g(y), we have 


wily) =Gly) +e 


whence 
y(z,y) = Fi(z,y) + Gly) +c. 


In higher dimension, successive integrations determine one after the other the 
unknown maps @1(@2,...,%n), W2(%3,---,2n),---,Wn—1(%n) that depend on a de- 
creasing number of variables. 


Example 9.48 
Consider the vector field in R® 
f(x,y, 2) = 2yzit 2z2(a4+ 3y)9 + (y (2x + 3y)+ 2z)k. 


It is straightforward to check curl f = 0, making the field conservative. Integ- 
rating 


O 
5a Y.#) = 2y7 


a) 
produces y(x,y,z) = 2xyz + wi(y,z). Its derivative in y, and 5 (t U.2) = 
Yy 
0 
2Qxz + byz, tells that hy, z) = 6yz, so 
Y 


Wily, z) = 3y7z + Yro(z). 
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a) 
Differentiating the latter with respect to z, and recalling 5 (a Yy,Z) = 2xyt 
z 
3y? + 2z, gives 
dy 
dz 
hence w2(z) = 2? + c. In conclusion, all potentials for f are of the form 


(2S 22, 


(x,y,z) = 2ayz + 8y°z +27 +c. 


We close the section by calling the attention to a class of conservative fields 
for which the potential is particularly easy to find. These are the so-called radial 
vector fields, 


f(x) = g(llell) x, 


where g = g(r) is a real, continuous function on an interval I of [0,-++0o). A field is 
radial when at each point P it is collinear with the vector OP, and its norm depends 
merely on the distance of P from the origin. A straightforward computation shows 
f is conservative on 2 = {x € R” : ||x|| € I} and gives a potential. Precisely, 
define 


p(x) = G(||all) 


where G = G(r) is an arbitrary primitive map of rg(r) on I. In fact, the chain 
rule plus Example 5.3 i), give 


az x 
Vo(x) = G(llall) — = llellg(lell) 7 = fla). 
rey = lella(led gep = Fe) 

For example, f(x) = —_* __ admits p(x) = 1+ |lx||? as potential. 


1+ |la|/? 


9.7 Exercises 


Compute the integral of the map 


x? (1 + 8y) 
J/l+y+4a2y 


along the arc y defined by y(t) = (t,t?, logt), t € [1,2]. 


f(x,y,z) = 


2. Let I be the union of the parabolic arc y = 4— x? going from A = (—2,0) 
to C = (2,0), and the circle x? + y? = 4 from C to A. Integrate the function 
f(x,y) = along the closed curve I’. 
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Integrate f(x,y) = x+y along the closed loop I’, contained in the first quad- 
rant, that is union of the segment between O = (0,0) and A = (1,0), the 
elliptical arc 4x? + y? = 4 from A to B = (2, V2) and the segment joining B 
to the origin. 


4. Compute the integral of f(x,y) = along the simple closed arc 


1 
a+y?4+1 
whose trace is made of the segment from the origin to A = (2,0), the circular 
arc x* + y? = 2 from A to B = (1,1), and the segment from B back to the 


origin. 


Let 71 : (0, 27] —> R? be given by y(t) = (tcost,tsint), and y2, 73 paramet- 
rise the segments from B = (—$, FR) to C = (—7,0) and from C to A = (0,0). 
Compute the length of y ~ 71, +72 +73 whose trace is the union I’ of the 


traces of 71, Y2, Y3, and then find the area of the domain inside I’. 


6. Find the centre of mass of the arc I, parametrised by +(t) = e' cost i—e’ sint J, 
t € [0,7 /2], with unit density. 


Consider the arc yy whose trace I’ is the union of the segment from A = 
(2\/3, -2) to the origin, and the parabolic arc y? = 2x from (0,0) to B = 
(sk, k). Knowing it has unit density, determine k so that the centre of gravity 
of I’ belongs to the x-axis. 


Determine the moment about z of the arc I: parametrised by y(t) = tcosti+ 
tsintj +tk, t € [0, V2] and having unit density. 


Integrate the field f (x,y) = (x?, zy) along y(t) = (t,t), t € [0,1]. 


10. What is the path integral of f(x,y,z) = (z,y,2x) along the arc y(t) = 
(t, 7,49), t € [0,1]? 


11. Integrate f(x,y, z) = (2,/z,,y) along y(t) = (—sint, cost, t?), t € [0, 5]. 


Compute the integral of f(x,y) = (xy?,x?y) along I’, the polygonal path 
joining A = (0,1), B = (1,1), C = (0,2) and D = (1, 2). 


13. Integrate f(x,y) = (0, y) along the following simple closed arc: a segment from 
the origin to A = (1,0), acirclex?+y? = 1 from A to B= (2, v2), a segment 
from B to the origin. 


Determine k so that the work done by the force 


f(x,y) = (a? —ay)it (y? — 27)j 


along the parabola y? = 2kx from the origin to P = (k/2,k) equals 9/5. 
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15. Compute the work integral of 
f(x,y) = (ax?y — sin x)i + (x* + ylogy)j 


along I’, union of the arcs I, Iz, I'3 of respective equations y = x7, y = 1, 
x = 0. Determine the parameter a so that the work is zero. 


Let f(x,y, z) = z(y—2x). Compute the integral of f over the surface o(u, v) 
(u,v, V16 — u2 — v2), defined on R = {(u,v) € R?: u<0,v > 0, u? +? 
16, % +0? > 1}. 


Integrate 


A 


ga 
f(z,y,2) = —“— 
4f/1+ 4 + 4y? 
over 5’, which is the portion of the elliptical paraboloid z = ae — y” above 


the plane z = —1. 


Determine the area of the compact surface >’, intersection of the surface z = 
sy? with the prism given by the planes x+y = 4, y-—x =4, y = 0. 


19. Compute the area of the portion of surface z = \/x?2 + y? below the plane 
[= aly +2). 


Find the moment, about the axis z, of the surface 7) part of the cone z? = 
3(a? + y?) satisfying 0 < z < 3, y > x (assume unit density). 


21. Let R in the zy-plane be the union of 


Ri = {(y,z) €R?:y <0, f1—- tl? <z< 4-9} 


and 
Ro ={(y,2) Rs y 20, y<z< V4—y}. 


Find the coordinate xg of the centre of mass for the part of the surface x = 
/4—y? — 22 projecting onto R (assume unit density). 


Using Green’s Theorem, compute the work done by 


f(x,y) =aryi+ a*yj 


along the closed counter-clockwise loop I’, union of I, I, 3, I of respective 
equations x = y*, y=1, 7 +y? =5, y=0, (x, y > 0). 
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Using Green’s Theorem, compute the work done by f(x,y) = x7y?i+ axj, as 


the parameter a varies, along the closed counter-clockwise curve I’, union of 
I, Iz, I of equations x? + y? =1,2=1, y=27+4+1 (2,y > 0). 


Let I be the union of I, parametrised by y(t) = (e~—*/) cost, sint), t € 


26. 


wo 
= 


[0, x /4], and the segments from the origin to the end points of I, A = (e7/*,0) 
and B = (2/2, 2/2). Find the area of the region Q bounded by I’. 


With the aid of the Divergence Theorem, determine the outgoing flux of 
f(x,y, 2) = (a +yz)it (ez t+y?)j + (ey t+ 2+ )k 
across §2, defined by 


a ay? pe? <1, a? bee 2" <0, god, 22a. 
Find the flux of 
f(x,y, 2) = (wy? + 2)b+ (a? + Dy) + 2(a%2 + 32° + 2)k 
leaving the compact surface 


e+yrte2<2, a +y?—2>0, y>0, 220. 


Using Stokes’ Theorem, compute the integral of 
f(x,y, 2) = zit yj + xyk 


along the boundary of 7, intersection of the cylinder x? + y? = 4 and the 
2 2 
paraboloid z = *7 + ¥-, oriented so that the normal points toward the z-axis. 


Given the vector field 
f(z.y,z) =(yt+2z)t+2(e+2)74+3(a+y)k 


and the sphere x? + y? + z? = 2, find the flux of the curl of f going out of the 
portion of sphere above z = y. 


Verify the field f(x,y, z) = xt — 2yj + 3zk is conservative and find a potential 
for it. 


Determine the map g € C®(R) with g(0) = 0 that makes the vector field 
f(x,y) = (ysine + zy cosz + e¥)i + (g(x) + ze”) j 


conservative. Find a potential for the resulting f. 
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31. Consider 

( g (y) 
x 


+ cosy)é + (2yloga — xsiny)j. 


a) Determine the map g € C®(R) with g(1) = 1, so that f is conservative. 
b) Determine for the field thus obtained the potential y such that y(1, 5) = 0. 


32. Consider the field 
f(x,y) = (2rlogy — ysinx)i + (2 +cosx)j. 


a) Determine g € C®(R) such that g(0) = 1 and making f conservative. 
b) For this field find the potential .p such that p(4,1) = 0. 


33. Consider 


f(x,y, z) = (3y + cos(a + 27))é + (3a + y +. g(z))9 + (y + 22 c08(a + 27))k. 


a) Determine the map g € C®(R) so that g(1) = 0 and f is conservative. 
b) For the above field determine the potential yp with ~(0,0,0) = 0. 


34. Determine the value of \ such that 
f (x, y, z) = (a? + BAy + Byz)it (5a + BAxz — 2)7 + (2+ A)zy — 4z)k 


is conservative. Then find the potential y with y(3,1, —2) = 0. 


9.7.1 Solutions 


t?(1 + 8t?) 
1. For t € [1,2] we have t)) = ————_———. , and 7(t) = (1, 2t, +) , so 
[1, 2] F(t) = Fa and ¥'(0) = (26,7) 
(1 + 8¢?) eee 63 
14+7244t4dt= fa (1+ 87) dt = —. 
he a $ yas 2 


3. Let us compute first the coordinates of B, in the first quadrant, intersection 
of the line y = 2x and the ellipse 4%? + y? = 4, which are B = (2, /2). The 
piecewise-regular arc -y can be divided in three regular arcs 71, 2, 3 of respective 
traces the segment OA, the elliptical arc AB and the segment BO. We can define 
arcs 0,, 62, 63 congruent to 1, 2, Y3 as follows: 


01 (t) = (t, 0) O<¢<a 1, i=, 


62(t) = (cost, 2 sint) OSG s a 02 ~ Yo, 


63(t) = (t, 2t) 0=i5 Os =¥3 


9.7 Exercises 
so that 
led ad aa 

Since 

f(6(é)) =t, f (62(t)) = cost + 2sint, f (53 (t)) = 3¢, 

6, (¢) = (1,0), 6;(t) = (—sint, 2 cost), 6,(4) = (1,2), 

6, =1, ||55(t)|| = Vsin? t + 4cos?¢, 153 (t)|| = v5, 
it follows 


1 a/4 V2/2 
| cart | (cos t+ 2sint) Vein? t+ Loos tat + | 3V5t dt 
0 0 0 


1 3 ne ae 
s+ 5v5+ costV4—3sin* tat +2 | sint\/ 1+ 3 cos? tdt 
0 0 


1 3 
==4-V5+h+h. 
ag vorieb 


[s 


For J,, set u = V3sint, so du = V3cost dt, and obtain 


V6/2 

1 

n==/ V4—u?du. 
V3 Jo 


Substituting v = 5 we have 


V6/2 

1 |1 U V5 2 V6 
I, = — |-uvV4-u?4+2 in — = — + —_ in —. 
1 3 5 uU aresin 5 | 7 BY rae 7 


Similarly for Iz, we set u = V3 cost, so du = —V3sint dt and 


V6/2 

2 

h=-— | V1+u?du. 
V3 V3 


Then 
1 V6/2 
eas ee 2 2 
Ip = = | l+u + log ( l+u +u)] 
V5 1 V10 + V6 
= —— +24 —— | log(2 — log —————_ ] 
5 + + og ( + V3) og 5 


4. Qarctan V2 + veg, 


All 


5. Since &(y) = (1) + €(y2) + £(¥3), we compute the lengths separately. The last 


two are elementary, (72) = 47 and ¢(y3) = 7. As for the first one, 
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Figure 9.18. The arc 7y and the region (2; relative to Exercise 5 


2/30 


2/30 
ies | @lar= fo vi4ePae 


=ny[1-+ =n? + = lo € oy ee 
=- <7? 4+ = 15 —77). 
3" 9 2 83 9 


The area is then the sum of the triangle ABC and the set 2; of Fig. 9.18. We 
know that area(ABC) = a As 2; reads, in polar coordinates, 


= {(r8) 0 <1 <0,0<0< sr}, 


(2/3)m pO 4 
area(Q1) = | dz dy = | | rdrd@é = ae 
Q 0 0 


eT 2 eT $1 
"es Benet 8" Ba en/e) 


7. The parameter k is fixed by imposing yg = 0. At the same time, 
YG >= jh yr | y 
V1 2 


where y(t) = (—V3t,t), t € [-2, 0] and yo(t) = (442, 1), t € [0, k]. Then 


0) 
1 
ve= | ovat fo i/1+tdt= 3 (1 (1 + k?)3/2 — 13); 
0 


—2 


we have 


6. We have 


the latter is zero when k = V/13?/8 — 1, 
8. As y'(t) = (cost —tsint)é + (sint +tcost)j +k and ||-+‘(t)||? = 2+ t?, we have 


La [@ + y?) =f #7/2+¢2dt = = V3 — 5 loa(t + V2). 
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9. From f (¥(t)) = (t,t?) and y(t) = (2, 1) follows 


1 1 
7 
4 43 Be 8 
[fa [8 @ena= fe +t) di= 7. 


9 T 
10. -; ll. -. 

A? 4 
12. The piecewise-regular arc y restricts to three regular arcs 1, Y2, 73, whose 
traces are the segments AB, BC, CD. We can define arcs 6;, 62 and 63 congruent 


to V1, Y2, Y3: 


61 (t) = (¢, 1) Os rel, aN, 
62(t) = (t,2—t) O<¢<1; 62 ~ — 2, 
63(t) = (t, 2) Pe et 63 ~ 3; 
Then from 
Ff (i (t)) = (t,t7), Ff (6o(t)) = (42-2)? 07(2-1)), Ff (63(t)) = (46, 207) 
6;(t) = (1,0), 6,(t) = (1,—-1), 63(t) = (1,0), 
we have 


[s-ar=[ pap-f sap] far 


= [we - (1,0) dt — [ GQ=1) O=—))(.—1)dt 
0 0 
+ (4t, 2t7) - (1,0) dt = 2. 
0 


13. 0. 


14. Let us impose 


where +/(t) = (szt?,t), t € [0,k] (note k 4 0, otherwise the integral is zero). Since 
¥ (t) = (gt), 


ky 45 4 4 
t a a: Ose 
r= — - => ?-—.)dt= nh. 
es r [ (a ane ~ aR 40 
Therefore ak = 2, so k = 2. 
15. The work integral equals L = 2 — =a, which vanishes if a = 3. 


16. The surface is the graph of y(u, v) = V16 — u? — v?, so (6.49) gives 
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-4 -2 fo 


Figure 9.19. The regions D, and Dz2 relative to Exercise 16 


Oy, 2 Oy, 2 4 


vu, oll = 1+ (S) + (SF) = Gala 


Therefore 


[t= [ ve u? — v2(v is —— = aig 
o R 


16 — u* — v4 


=4 [ (v—2u)duav. 
R 


The region R is in the second quadrant and lies between the circle with radius 
4, centre the origin, and the ellipse with semi-axes a = 2, b = 1 (Fig. 9.19). 
Integrating in v first and dividing the domain into D; and Dg, we find 


V16—u2 J16—u2 
728 
[=a I. | (vu — 2u) Javan fo i (v—2u)dudu) ==. 
_ oa 


17. A parametrisation for ’ is (Fig. 9.20) 


—v’), (u,v) ER = {(u,v) € R242 =1}. 


a(u,v) = (u,v, -— 


Then ||v(u,v)||2 = 1+ 4 +40? and 


[f= for nauae, 


Passing to elliptical polar coordinates, 


27 1 
| faa | (rsin@ + 1)rdrd6 = 2r. 
x 0 Jo 
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Figure 9.20. The region R (left) and the surface » (right) relative to Exercise 17 


18. Parametrise »’ by 
1, 
a(u,v) = (u, v, 3” ) ; (u,v) ER, 


where F is the triangle in the uv-plane of vertices (—4,0), (4,0), (0,4) (Fig. 9.21). 
In this way ||v(u, v)|| = V1 + v2, so 


area(Z) = f r= f VIF Pana = | ( : 


14 2 
= SZ VIT + dlog(4 + VI7) + 5. 


4—v 


V1l+v? du) dv 
4 


19. area(2’) = 82. 


20. Using cylindrical coordinates >’ reads 
o(r,0) =(r,0, V8r), — (r0) ER={(r,0):0 <r < V3, 


(see Fig. 9.22). Then ||v(r,@ )|| = 2 and 


z 


Figure 9.21. The region R (left) and the surface »’ (right) relative to Exercise 18 
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yYrer 


Figure 9.22. The region R (left) and the surface ©’ (right) relative to Exercise 20 


(5/4)n pV/3 9 
[@+v)=2 | Prarae—2 | | r? dr dé = <n. 
b5; R n/4 0 2 


21. ag = 2%. 


22. The arc is shown in Fig. 9.23. The work equals L = $n f -7, but also 


b= | (B-F)acay 


by Green’s Theorem, with Q inside I and f;(x,y) = ry, fo(x,y) = x*y. Then if 
we integrate in « first, 


1 /5—-y? AT 
b= | / (4x°y — 2) dady = —. 
0 y 6 


2 


Figure 9.23. The arc I’ and the region 2 relative to Exercise 22 
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y=1+2? 


Y3 


Figure 9.24. The arc I" and the region 92 relative to Exercise 23 


23. The arc I’ is shown in Fig. 9.24. Green’s Theorem implies the work is 


b= $ f-r= | (a—20%y)dedy, 


where J? is inside J’. Integrating vertically, 


L= | / (a 20y) dyde = | Jay — 2?y?] dx =a(=-7)-=. 
0 JVi—-a 0 y=V1-a? 3. ¢=4 35 


24. We use (9.23) on the arc I’ (see Fig. 9.25). The integral along OA and OB is 
zero; since y(t) = (—e~ 7/4 (cost + sin t), cost), it follows 


m/A 
| (sint(cost + sin t) + cos” t)e~—-*/9 dt = —— + Se" 
0 


0 A x 


Figure 9.25. The arc I’ and the region 2 relative to Exercise 24 
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25. The Divergence Theorem tells 


| fon=f aif =f (0? +y? + 2)dedyde. 
an Q Q 


In spherical coordinates 2 becomes $2’ defined by 0 < r < 1,0 < y < 7/4 and 
0<60<7. Then 


am por/4 pl 
| fon= | } | aA snedrdode =" nO=4>), 
aQ 0 Jo 0 10 


26. The flux is 8a(/2 — 1). 


27. Let us parametrise »’ by 


2 


o(u,v) = (u,v, +), with (u,v) ER = {(u,v) € R?:u? +? <4}. 


i) 


Then v(u,v) = (—éu, —$,1) is oriented as required. In order to use Stokes’ The- 


orem, notice curl f = 1i — y7 + Ok, and 
Py 
| fae) curl fon = [ (ui — vj + Ok) - (- <i — —j +k) dudv 
as by R 9 2 
ix fe43 *) dud 
= —-u’ + -v~)dudv. 
Rr 9 Zz 
In polar coordinates, 
21 2 
2 1 10 
faa) | (— = cos” 6+ =sin* 0)r? dr dé = —r. 
ay 0 Jo 9 2 9 
28. We use Stokes’ Theorem with 
y= 1 (G2) ER? :a7?4+y? +27 =2, z>y}, 


whose boundary is 0’ = {(z,y,z) € R® : a? + y?4+ 27 = 2,z = y}. So we 
parametrise 02) by 


¥(t) = (V2 cost, sint, sin t) , t € (0, 27] ; 
then 
f(y(t)) =2sinti+ 2(V2cost + sint) j + 3(V2cost + sint) k, 
+'(t) = —V2sinti+costj + costk 
and 


fewpn= ff p-r=svin, 
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29. The field f is defined on the whole R? (clearly simply connected), and its curl 
is zero. By Theorem 9.45 then, f is conservative. 
To find a potential y, start from 


Op Op 
Op _ OP OP a Bo 9.27 
Qn” , gy 20 


From the first we get 
1 
g(a, Y; z) — xt a wry, z) ; 
Differentiating in y and using the second equation in (9.27) gives 


Ov Ov, 
— = — — —2 


hence 
dily,z)=—y?+%2(z) and = g(a, y,z) = =a? — y? + Y2(z). 


Now we differentiate y in z and use the last in (9.27): 


Oy _ dye _ 3 2 
By 2) = ae (2)=32, so baz) = 52 +c. 


All in all, a potential for f is 


1 3 
P(t yz) = 50 — YP + 52° +e, cER. 


30. The field is defined over the simply connected plane R?. It is thus enough to 
impose 
Of, 


By ey) — Gq (h y), 


where fi(z,y) = ysinz + zycosz +e¥ and fo(z,y) = g(x) + ze’, for the field to 
be conservative. This translates to 


sing +xcosxz +e = g'(x) +e", 
ie., g(x) = sinz + xcosz. Integrating, 
g(a) = f (sine +2008) de =asinz +e, ceER. 
The condition g(0) = 0 forces c = 0, so the required map is g(x) = xsinz. 


To find a potential y for f, we must necessarily have 


0 O 
& (x,y,z) = ysine + eycosa + e¥ and ? (x,y,z) =asine + ae". 


Oy Oy 
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The second equation gives 
y(2,y) = cysina + ze” + a(x); 


differentiating in x and using the first equation gives 


a) ; 
5 (52) = ysinz + rycosx +e¥ + w(x) =ysinz + rycosx +e". 
a 


Therefore wy’ (x) = 0, so Y(a«) = c for an arbitrary constant c in R. In conclusion, 


yp(z,y) = zysinzg+2xze¥ +c, ceER. 
31. a) g(y) = "5 b) pla, y) = y? log x + x cos y. 
22.8) o(2) =a? 4 1; b) p(x, y) = (a? + 1) logy + ycosa — 4. 
33. a) g(z) =z -1; b) v(a, y, z) = 8ay+ sy” +yz—yt+sin(x + 27). 


34.A=1e o@y;2)= ge + Bayz + Say — 227 — Qy+ 4. 
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Ordinary differential equations 


Differential equations are among the mathematical instruments most widely used 
in applications to other fields. The book’s final chapter presents in a self-contained 
way the main notions, theorems and techniques of so-called ordinary differential 
equations. After explaining the basic principles underlying the theory we review 
the essential methods for solving several types of equations. We tackle existence 
and uniqueness issues for a solution to the initial value problem of a vectorial 
equation, then describe the structure of solutions of linear systems, for which 
algebraic methods play a central role, and equations of order higher than one. In 
the end the reader will find a concise introduction to asymptotic stability, with 
applications to pendulum motion, with and without damping. 

Due to the matter’s richness and complexity, this exposition is far from ex- 
haustive and will concentrate on the qualitative aspects especially, keeping rigorous 


arguments to a minimum!. 


10.1 Introductory examples 


The reader will presumably be already familiar with differential equation that 
formalise certain physical principles, like Newton’s law of motion. Differential 
equations are natural tools, hence widely employed to build mathematical mod- 
els describing phenomena and processes of the real world. The reason is that 
two quantities (the variables) often interact by affecting one another’s variation, 
which is expressed by a relationship between one variable and some derivative of 
the other. Newton’s law, for instance, states that a force influences a particle’s ac- 
celeration, i.e., the change of its velocity in time; and velocity itself is the variation 
of displacement. If the force is a function of the particle’s position, as happens in 
the case of a spring, we obtain an equation of motion in terms of time t 


' For proofs, the interested reader may consult, among others, the rich monograph by 
G. Teschl, Ordinary Differential Equations and Dynamical Systems, American Math- 
ematical Society, 2012. 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_10, 
© Springer International Publishing Switzerland 2015 
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what this says is that the spring’s pulling force is proportional to the displacement, 
and acts against motion (k > 0). A more complicated example is 


where the spring’s rigidity is enhanced by an increase of the displacement; both 
these are differential equations of second order in the variable t. Another ex- 
ample originating from classical Mechanics, that will accompany us throughout 
the chapter, is that of a pendulum swinging freely on a vertical plane under the 
effect of gravity and possibly some friction. The relationship between the angle 6 
swept by the rod and the bob’s angular velocity and acceleration reads 


d?0 


C7 tac +ksind =0. 


Multiple mass-spring systems, or compound pendulums, are governed by systems 
of differential equations correlating the positions of the various masses involved, 
or the angles of the rods forming the pendulum. 

The differential equations of concern, like all those treated in the chapter, 
are ordinary, which means the unknowns depend upon one independent variable, 
conventionally called ¢ (the prevailing applications are dynamical, of evolution 
type, so t is time). Partial differential equations, instead, entail unknown functions 
of several variables (like time and space coordinates), and consequently also partial 
derivatives. 

Ordinary differential equations and systems are fundamental to explain how 
electric and electronic circuits work. The simplest situation is that of an DRC 
circuit, where the current 7 = i(t) satisfies a linear equation of second order of the 


type 


Electric networks can be modelled by systems of as many of the above equations 
as the number of loops, to which one adds suitable conservation laws at the nodes 
(Kirchhoff’s laws). For electronic circuits with active components, differential mod- 
els are typically non-linear. 

Materials Sciences, Chemistry, Biology, and more generally Life and Social Sci- 
ences, are all fertile sources of mathematical models based on ordinary differential 


equations. For instance, 
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describes the radioactive decay of a substance y (the rate of disappearance is pro- 
portional, for k < 0, to the quantity of matter itself). But the same equation 
regulates the dynamics of populations (called in this case Malthus’ law, it predicts 
that the rate of growth y’/y of a population in time equals the difference k = n—m 
between the constant birth rate n and death rate m). Malthus’ model becomes real- 
istic by further assuming k is corrected by a braking term that prevents unlimited 
growth, via the so-called logistic growth equation 


y' = (k— By)y. 
Another example are the Volterra-Lotka equations 


(ky — Bip2)p1 


= (—ke + Bop1)po, 


that preside over the interactions of two species, labelled preys p; and predators po, 
whose relative change rates p//p; are functions not only of the species’ respective 
features (ki, k2 > 0), but also of the number of antagonist individuals present on 
the territory (1, G2 > 0). 

At last, we mention a source — intrinsic, so to say, to differential modelling — of 
very large systems of differential equations: the discretisation with respect to the 
spatial variables of partial differential equations describing evolution phenomena. 
A straightforward example is the heat equation 


Ou O07u 


a agp = 0<a<L,t>0, (10.1) 


controlling the termperature u = u(t) of a metal bar of length L in time. Dividing 
the bar in n+ 1 parts of width Az by points x; = jAz,0 < j < n+1 (with 
(n + 1)Ax = L), we can associate to each node the variable u; = u,(t) telling 
how the temperature at x; changes in time. Using Taylor expansions, the second 
spatial derivative can be approximated by a difference quotient 


ao), ee uj—1(t) — 2uj(t) + uj+i(t) 
0x2?” Ax? , 


so equation (10.1) is well approximated by a system of n linear ordinary differential 
equations, one for each internal node: 


The first and last equation of the system contain the temperatures ug and uy+1 at 
the bar’s ends, which can be fixed by suitable boundary conditions (e.g., uo(t) = ¢, 
Un+i(t) = w if the temperatures at the ends are kept constant by thermostats). 
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10.2 General definitions 


We introduce first of all ‘scalar’ differential equations (those with one unknown), 
for the benefit of readers who have not seen them in other courses. 

An ordinary differential equation (ODE is the standard acronym) is a 
relationship between a real independent variable (here, t), an unknown function 
y = y(t), and the derivatives y‘”) up to order n 


F(t, yy’, ,y™) = 0, (10.2) 


where ¥ is areal map of n+ 2 real variables. The differential equation has order 
n, if n is the highest order of the derivatives of y appearing in (10.2). A solution 
(in the classical sense) of the differential equation on the real interval J is a map 
y: J —R, differentiable n times on J, such that 


Fit, y(t), y'(),..,y™(t)) =0 for allt e J. 


Using (10.2) it is often useful to write the highest derivative y in terms of 
t and the other derivatives (in several applications this is precisely the way a dif- 
ferential equation is typically written). In general, the Implicit Function Theorem 
(Sect. 7.1) makes sure that equation (10.2) can be solved for y‘”) when the partial 
derivative of F in the last variable is non-zero. If so, (10.2) reads 


et (eee ie (10.3) 


with f areal map of n+1 real variables. Then one says the differential equation is 
in normal form. The definition of solution for normal ODEs changes accordingly. 


As the warm-up examples have shown, beside single equations also systems of 
differential equations are certainly worth studying. The simplest instance is that 
of a system of order one in n unknowns, written in normal form as 

yt = filt, y1, Yo, es ty) 
Ya = fo(t, yi, ya, --+,Yn) 


(10.4) 
ie = Fults Y1, Y2, bits Ue Vs 
where f; is a function of n + 1 variables. The vectorial notation 
y = f(t,y) (10.5) 


where y = (Yi)i<i<n and f = (fi)i<i<n is quite convenient. A solution is now 
understood as a vector-valued map y: J > R”, differentiable on J and satisfying 


y(t) = f(t, y(t)) for allte J. 
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An especially relevant case is that of linear systems 


yi = a1i(t)y1 + ar2(t)yo +...+ ain(t)yn + b1(t) 
Yo = agi (t)y1 + a22(t)y2 +...t+ don (t)Yn + b2(t) 


y,, = Anilt)y1 + Qne(t)yo +... + @nn(t)yn + dn (t), 
or, in compact form, 


y’ = Alt)y + b(t) (10.6) 


where A(t) = (aj; (a) eee 
matrices, and b(t) = (b;()),-,<,, is a map from J to R”. First-order linear equa- 
tions play a particularly important role, both theoretically and in view of the 
applications: on one hand, they describe many problems that are linear in nature, 
on the other hand they approximate more complicated, non-linear equations by a 
linearisation process (see Remark 10.25). 

In writing a differential equation like (10.5) or (10.6), it is customary not to 
write the t-dependency of the solution y explicitly. 

Systems of first order are the main focus of our study, because they capture 
many types of differential equations provided one adds the necessary number of 
unknowns. Each differential equation of order n can be indeed written as a system 
of order one in n unknonws. To be precise, given equation (10.3), set 


maps J to the vector space R”’” or squaren xX n 


yay, Lt 


L€., 


(10.7) 
The differential equation then becomes 
Une = PC Uijoes Un) 
so we obtain the first-order system 
Yi = Ye 
(10.8) 


Yn—1 = Yn 
Un = Me Ura owes: 
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It is clear that any solution of equation (10.3) generates a solution y of the system, 
by (10.7); conversely, if y solves the system, it is easy to convince ourselves the 
first component y; is a solution of the equation. System (10.8) is then equivalent 
to equation (10.3). 


The generalisation of (10.4) is a system of n differential equations in n un- 
knonws, where each equation has order greater or equal than 1. Every equation of 
order at least two transforms into a system of first order. Altogether, then, we can 
always reduce to a system of order one in m > n equations and unknonws. 


Let us go back to equation (10.5), and suppose it is defined on the open set 
Q=Ix DCR"*!, where J C R is open and D C R” open, connected; assume 
further f is continuous on 2. 


Definition 10.1 A solution of the differential equation (10.5) is a C+ func- 
tion y = y(t) : J > D, with J a non-empty open interval in I, such that 


y'(t) = f(t, y(t)), Vt € J. 


We remark that a solution may not be defined on the entire I. 

A solution y(t) is therefore a differentiable curve defined on J with trace con- 
tained in D. The vector f Ges y(tx)), when not 0, is tangent to the curve at each 
point ¢, € J. 

The graph of y(t) 


Gy) = {(t,y()) € 1x DCR": te J} 


is called an integral curve of the differential equation (see Fig. 10.1). 


Y1 


to ty tg t 


Figure 10.1. Integral curves of a differential equation and corresponding tangent vectors 


10.2 General definitions 427 


The differential equation has, in general, infinitely many solutions. Typically 
these depend, apart from t, upon n arbitrary constants c1,...,Cn. We indicate 
with y(t;c,,...,€n) the set of solutions, which is called general integral. 


A natural way to select a special solution is to impose that this solution takes 
at the time to € J a given value yo € D. We thus consider the problem of finding 
y = y(t) such that 


(10.9) 


aoe in J, 


y(to) = Yo, 


where J is an open sub-interval of J containing to. Since J in general depends on the 
solution, it cannot be determined a priori. This kind of question is called Cauchy 
problem for the differential equation, or initial value problem because it models 
the temporal evolution of a physical system, which at to, the initial moment of the 
mathematical simulation, is in the configuration yo. Geometrically speaking, the 
condition at time to is equivalent to asking that the integral curve pass through the 
point (to, yo) € 2. A Cauchy problem may be solvable locally (on a neighbourhood 
J of to), or globally (J = J, in which case one speaks of a global solution); these 
two situations will be treated in Sects. 10.4.1 and 10.4.3, respectively. 


A particularly important class of ODEs is that of autonomous equations 


for which f does not depend explicitly on t. The linear system 
y = Ay 


with given A € R””, is one such example, and will be studied in Sect. 10.6. For 
autonomous ODEs, the structure of solutions can be analysed by looking at their 
traces in the open set D C R”. The space R” then represents the phase space of 
the equation. 

The projection of an integral curve on the phase space (Fig. 10.2) 


I'(y) = {y@) € D:te J} 


is called an orbit or trajectory of the solution y. A solution y of the Cauchy 
problem (10.9) defines an orbit passing through yo, which consists of a past tra- 
jectory (t < to) and a future trajectory (t > to). The orbit is closed if it is closed 
viewed as trace of y. Conventionally an orbit in phase space is pictured by the 
streamline of the flow, and the orientation on it describes the generating solution’s 
evolution. 


Application: the simple pendulum (I). The simple (gravity) pendulum is the 
idealised model of a mass m (the bob) fixed to the end of a massless rod of length L 
suspended from a pivot (the point O). When given an initial push, the pendulum 


428 10 Ordinary differential equations 


t=to 


Figure 10.2. An integral curve and the orbit in phase space yiy2 of an autonomous 
equation 


swings back and forth on a vertical plane, and the bob describes a circle with 
centre O and radius L (Fig. 10.3). In absence of external forces, e.g., air drag or 
friction of sorts, the bob is subject to the force of gravity g, a vector lying on the 
fixed plane of motion and pointing downwards. To describe the motion of m, we 
use polar coordinates (r,0) for the point P on the plane where m belongs: the 
pole is O and the unit vector 7 = (cos0, sin 0) is vertical and heads downwards; let 
t,. = t,(@), te = te(@) be the orthonormal frame, as of (6.31). 
The position of P in time is given by the vector OP(t) = Lt,(@(t)), itself 
determined by Newton’s law : 
d-OP 
ma = 9- (10.10) 


Figure 10.3. The simple pendulum 


10.2 General definitions 429 
Successively differentiating OP(t), with the aid of (6.39) we obtain 
dOP dt, dé dé 
= = 


dt dddt dt °’ 

d2OP d26 dé dt dé d26 do\? 
eden goad ee eg ah) ae, 
dt? eo ae ae de a (S) 


On the other hand, if g is the modulus of gravity’s pull, then g = mgt = 
mg cos @t, — mgsin@ tg, so Newton’s law becomes 


2 2 
mL (- € t, + a) = mg (cos 6t,. = sin 6 ta) . 


Taking only the angular component, with k = g/L > 0, produces the equation of 
motion 
d?6 
—~ +ksin@=0; 10.11 
dt? ( ) 
notice that the mass does not appear anywhere in the formula. 
A mathematically more realistic model takes into account the deceleration 
generated by a damping force at the point O: this will be proportional but opposite 
to the velocity, and given by the vector 


dé 
a= —amL — te, with a> 0, 
dt 
adding to g in Newton’s equation. As previously, the equation of motion (10.11) 
becomes 
d0 dé 


pe ee a (10.12) 
An ideal (free, undamped) motion is subsumed by taking a = 0, so we shall 
suppose from now ona >Oandk> 0. 

To find the position of P at time t, we first need to know its position and 
velocity at the starting time, say tp = 0. Let us then declare 69 = 0(0) and 
i= 54 (0), which, in essence, is the statement of an initial value problem for an 


equation of order two: 


d0 dé 
Gp tog + ksing =0, i> U0; 


do 
(0) = 8, = 


(10.13) 


(Se. 


To capture the model’s properties, let us transform this to first-order form 
(10.34) by setting 
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dé 
y = (y1, y2) = (8, ap) Yo = (90, 61), f(t,y) = Fly) = (ye,—-ksin yi — aye). 


The system is autonomous, with J = R and D = R?. 
(The example continues on p. 448.) Oo 


10.3 Equations of first order 


Before we undertake the systematic study of differential equations of order n, 
we examine a few types of first-order ODEs that stand out, in that they can be 
reduced to the computation of primitive maps. 


10.3.1 Equations with separable variables 


One speaks of separable variables (or of a separable ODE) for equations of the 
following sort 


y’ = g(t)h(y), (10.14) 


where g is continuous in t and h continuous in y. This means the function f(t, y) 
is a product of a map depending only on t and a map depending only on y, so that 
the variables are ‘separate’. 

If y € Ris a zero of h, h(y) = 0, the constant map y(t) = y is a particular 
integral of (10.14), for the equation becomes 0 = 0. Therefore a separable ODE 
has, first of all, as many integrals y(t) = constant as the number of the distinct 
roots of h. These are called singular integrals of the equation. 

On every interval J where h(y) does not vanish we can write (10.14) as 


1 
Let H(y) be a primitive of On (with respect to y). The formula for differentiating 
y 


composite functions gives 


d dH dy 1 dy 
HOw) = FS = oH = 009, 
whence H(y(t)) is a primitive of g(t). Thus, given any primitive G(t) of g(t) we 
will have 


H(y(t))=Git)+e, ceR. (10.15) 


1 dH : : 
But since —— = —— never vanishes on J by assumption — and thus does not 


h(y) dy 
change sign, being continuous — the map H(y) will be strictly monotone on J, 


10.3 Equations of first order 431 


hence invertible (Vol. I, Thm. 2.8). This implies we may solve (10.15) for y(t), to 
get 


y(t) = H-*(G(t) +0), (10.16) 


where H~! is the inverse of H. The above is the general integral of (10.14) on 
every interval where h(y(t)) is not zero. But, should we not be able to attain the 
analytic expression of H~!(x), formula (10.16) would have a paltry theoretical 
meaning. In such an event one is entitled to stop at the implicit form (10.15). 

If equation (10.14) admits singular integrals, these might be of the form (10.16) 
for suitable constants c. Certain singular integrals can be inferred formally from 
(10.16) letting c tend to too. 

One recovers expression (10.15) in a formal and easy-to-remember manner 


by interpreting the derivative = as a ‘quotient’, in Leibniz’s notation. In fact, 


dividing (10.14) by h(y) and ‘multiplying’ by dt gives 


Rae 
h(y) 


= g(t) dt, 


which can be then integrated 


This corresponds exactly to (10.15). At any rate the reader must not forget that 
the correct proof of the formula is the aforementioned one! 


Examples 10.2 


i) Let us solve y’ = y(1—y). If we set g(t) = 1 and h(y) = y(1 — y), the zeroes 
of h determine two singular integrals y;(t) = 0 and yo(t) = 1. Next, assuming 
h(y) different from 0, we rewrite the equation as 


fawn * 


then integrate with respect to y on the left and t on the right to obtain 


log |———| =t+ec 
Ly 
Exponentiating yields 
7 | aette= ke’, 
Ly 
where k = e® is an arbitrary constant > 0. Therefore 
a +ke’ = Ke’, 


i) 
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with K being any non-zero constant. Solving now for y in terms of t gives 
(t) = Ket 
I) TO Kee 
In this case the singular integral y;(t) = 0 belongs to the above family of solutions 


for K = 0, a value originally excluded. The other integral y2(t) = 1 arises 
formally by taking the limit, for K going to infinity, in the general expression. 


ii) Consider the ODE 
y = VV. 


It has a singular integral y(t) = 0. Separating variables gives 


sO 
i 2 
2./y =tte, ie., ut) =(5 +6) ; cER, 


where we have written c in place of c/2. 
iii) The differential equation 
peed 


y ev +1 


> 0 for any y, so there are no singular integrals. 


1 
h t)=e'+1, h(y) = 
as g(t) = et +1, h(y) = = 
The separating recipe gives 


[tere nay = fie +aae, 


ety=e+t+e, cER. 
But now we are forced to stop for it is not possible to explicitly write y as 
function of the variable t. 


sO 


10.3.2 Homogeneous equations 


Homogeneous are ODEs of the type 


y' =9 (<) (10.17) 
with y = y(z) continuous in z. The map f(t, y) depends on t and y in terms of 
their ratio ; only; equivalently, f(At,Ay ) = f(t, y) for any A £0. 


A homogeneous equation can be transformed into one with separable variables 


y(t) 


by the obvious substitution z = 2 understood as z(t) = —* Then y(t) = tz(t) 
and y(t) = z(t) + tz’(t), so (10.17) becomes 
/ y(z) me 


Z =, 


U 
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which has separable variables z and t, as required. The technique of Section 10.3.1 
solves it: each solution Z of y(z) = z gives rise to a singular integral z(t) = 2, 
hence y(t) = Zt. Assuming y(z) different from z, instead, we have 


giving 


H(z) = log |t| +c 


where H(z) is a primitive of . Indicating by H~+ the inverse function, we 
Zz 


i 
y(z) — 
z(t) = H*(log|t| +e), 


have 


hence, returning to y, the general integral of (10.17) reads 
y(t) =t H* (log |t| +c). 


Example 10.3 
We solve 
Py =y +tyt+e. (10.18) 
In normal form, this is 


— u) 2 a7 
0 a “ar 


a homogeneous equation with y(z) = z2-+z+1. The substitution y = tz generates 
a separable equation 


There are no singular integrals, for z? + 1 is always positive. Integration gives 
arctan z = log |t| +c 
so the general integral of (10.18) is 
y(t) = t tan(log |¢| + c). 
The constant c can be chosen independently in (—co,0) and (0,+00), because 


of the singularity at t = 0. Notice also that the domain of each solution depends 
on the value of c. 


10.3.3 Linear equations 


The differential equation 


y’ = a(t)y + o(t) (10.19) 
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with a, b continuous on J, is called linear.? The map f(t,y) = a(t)y + b(t) is 
a polynomial of degree one in y with coefficients dependent on t. The equation 
is called homogeneous if the source term vanishes identically, b(t) = 0, non- 
homogeneous otherwise. 

Let us start by solving the homogeneous equation 


y' = a(t)y. (10.20) 


It is a special case of equation with separable variables where g(t) = a(t) and 
h(y) = y, see (10.14). Therefore the constant map y(t) = 0 is a solution. Apart 
from that, we can separate variables 


[ow - fowar 


If A(t) indicates a primitive of a(t), i-e., if 


Q 
c 
Q 
— 
| 
a 
= 
+ 
a2) 
io) 
MM 
A 


(10.21) 


then 
log |y| = A(t) +, 


or equivalently 
ly(t)| =e°e*®, 


so 
y(t) = +KeA®, 


where kK = e° > 0. The particular solution y(t) = 0 is subsumed by the gen- 
eral formula if we let K be 0. Therefore the solutions of the homogeneous linear 
equation (10.20) are given by 


y(t) = Ke4, KER, 


with A(t) as in (10.21). 

Now we tackle the non-homogeneous case. We use the so-called method of 
variation of constants, or parameters, which consists in searching for solutions of 
the form 

y(t) = K(t)e4), 
where now k(t) is a function to be determined. The representation of y(t) is always 
possible, for e4) > 0. Substituting into (10.19), we obtain 


K'(t)e4 + K(t)e4Ma(t) = a(t) K (t)e4™ + o(2), 
so  K'(t)=e 4o(2). 


? Contrary to the description of Vol. I, here we prefer to write the linear term y on the 
right, to be consistent with the theory of linear systems. 
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Calling B(t) a primitive of e~4Mb(t), 
fl eAMs(t)dt = Bi) +e, ceER, (10.22) 


we have 


so the general solution to (10.19) is 
y(t) = e4 (Bit) +0), (10.23) 


with A(t) and B(t) defined by (10.21) and (10.22). The integral is more often than 
not found in the form 


y(t) = el a(t) dt i e7 J a(t) dt b(t) dt, (10.24) 


where the steps leading to the solution are clearly spelt out, namely: one has to 
integrate twice in succession. 
If we want to solve the Cauchy problem 


‘= a(t)y + W(t the interval J, 
{? a(t)y + b(t) on the interva (10.25) 


y(to) = Yo, with to € J and yo ER, 
it might be convenient to choose as primitive of a(t) the one vanishing at to, which 
we write A(t) = / a(s) ds, according to the Fundamental Theorem of Integral 
Calculus; the ean be done for B(t) by putting 
t 
B(t) = 4 on Sig 4) 84 B05) dg 
0 


(recall that the variables in the definite integral are arbitrary symbols). Using these 
expressions for A(t) and B(t) in (10.23) we obtain y(to) = c, hence the solution of 
the Cauchy problem (10.25) will satisfy c = yo, i.e., 


(10.26) 


Examples 10.4 
i) Find the general integral of the linear equation 
y' =ay +6, 
where a # 0 and 0 are real numbers. 
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Choosing A(t) = at and B(t) = —2e~“ generates the integral 
at b 
y(t) = ce™ — -. 
a 
If a = 1 and b = 0, the formula shows that all solutions of y’ = y come in the 
form y(t) = ce’. 
If we want to solve the Cauchy problem 
1° =ay+b on [1,+0co), 
y(1) = yo, 
b 
it is better to choose A(t) = a(t — 1) and B(t) = . (1 - eat), so that 


b b 
v= (vr 2) oer 2 


a 
In case a < 0, the solution converges to —b/a (independent of the initial datum 
Yo) as t + +00. 
ii) We find the integral curves of the ODE 

ty t+y=0? 
that lie in the first quadrant of the plane (t, y). Written as (10.19), the equation 


1S 


pl +t 
y= t! ) 


so a(t) = —1/t, b(t) = t. Choose A(t) = — logt, and then e4) = 1/t, e"4 =¢,; 
consequently, 


ferro dt = ye dt = st +c. 


Therefore for t > 0, the general integral reads 


aie eae 
=ofe = = 42, 
y(t) AG +e) 3 + 


If c > 0, then y(t) > 0 for any t > 0, while if c < 0, y(t) > 0 when t > ¥/3j|cl. 


Remark 10.5 In the sequel it will turn out useful to consider linear equations of 
the form 


2 2, (10.27) 
where A is a complex number. We should determine the solution over the complex 


field: this will be a differentiable map z = Rez +iZImz:R-— C (i.e., its real and 
imaginary parts are differentiable). Recall that the equation 


ae =e teR (10.28) 
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(proved in Vol. I, Eq. (11.30)) is still valid for \ € C. Then we can say that any 
solution of (10.27) can be written as 


z(t) = ce (10.29) 


with c € C arbitrary. Oo 


10.3.4 Bernoulli equations 


This family of equations is characterised by the following form 


y’ = p(t)y* + q(t)y,a #0,a#£1, (10.30) 


with p, q continuous on I. If a > 0, the constant map 0 is a solution. Supposing 
then y ~ 0 and dividing by y® gives 


y cy! =p(t)+q(t)y*. 


As (y!~%)' = (1—a)y~°y’, the substitution z = y!~° transforms the equation 
into a linear equation in z 


z' = (1—a)p(t) + (1 — a)a(é)z, 
solvable by earlier methods. 


Example 10.6 
Take y’ = t?y? + 2ty, a Bernoulli equation where p(t) = t?, q(t) = 2t and a = 2. 


The transformation suggested above reduces the equation to 2’ = —(2tz + t°), 
solved by z(t) = ce’ — (2 + t?). Hence 
1 
t) = ——_____ 
y( ) cet? _ (2 + t?) 
solves the original equation. O 


10.3.5 Riccati equations 


Equations of Riccati type crop up in optimal control problems, and have the typical 
form 


y’ = p(t)y? + a(t)y + r(t), (10.31) 


where p, q, r are continuous on J. The general integral can be found provided we 
know a particular integral y = u(t). In fact, putting 


1 x! 
= t = h -— ‘(t) — ; 
y=u)+4, hence y=) - 3 
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we obtain 


u'(t) — - = p(t) (u?(t) + yn) + =) + q(t) (u(t) + =) + r(t). 


Since u is a solution, this simplifies to 


z' = —(2u(t)p(t) + g(t))z — ple), 
and once again we recover a linear equation in the unknown z. 


Example 10.7 


Consider the Riccati equation 


; », {1—-2¢ 2-1 
y =ty + y+ 


t 
where 
_ bee i? —1 


pi)=t,  g)=—, y=. 


There is a particular solution u(t) = 1. The aforementioned change of variables 
gives 


po. 
2 Ss—— 74, 
t 
solved by 
c—-# 
t — 
It follows the solution is 
3|t| 
y(t) =1+ —— 3) ceER. 


10.3.6 Second-order equations reducible to first order 


When a differential equation of order two does not explicitly contain the dependent 
variable, as in 


aie (10.32) 
the substitution z = y’ leads to the equation of order one 
= 7 (ee) 


in z = z(t). If the latter has a general integral z(t; c1), all solutions of (10.32) arise 
from 
Y=, 


10.3 Equations of first order 439 


hence from the primitives of z(t;c1). This introduces another integration constant 
cg. The general integral of (10.32) is thus 


y(t; C1, C2) = f xltie1) at = Z(te1) + 0a, 


where Z(t;c1) is a particular primitive of z(t;c1). 


Example 10.8 
Solve the second-order equation 
y’ — (PP =1. 
Set z = y’ to obtain the separable ODE 
g= 2741. 
The general integral is arctan z = t+ c1, so 
z(t, c1) = tan(t+c1). 
Integrating again gives 


YE Ci,ts) = [tanlt-+er)at = — log ( cos(t + ¢1)) +2, c1,c2 ER. . 


Consider, at last, a second-order autonomous equation 


y= fiyy’). (10.33) 


We shall indicate a method for each interval J C R where y’(t) has constant sign. 
In such case y(t) is strictly monotone, so invertible; we may consider therefore 
t = t(y) as function of y on J = y(Z), and consequently everything will depend on 
y as well; in particular, z = wy should be thought of as map in y. Then 


yn dz  dzdy_ dz 


= at dy dy” 


whereby y” = f(y, z) becomes 


This is of order one, y being the independent variable and z the unknown. Sup- 
posing we are capable of finding all its solutions z = z(y;c1), the solutions y(t) 
arise from the autonomous equation of order one 


a = AWG 


coming from the definition of t. 
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Example 10.9 


Find the solutions of 


yy" = (y')? 
L€., 
y! 2 
yr ,  (y#0) 
Set f(y, z) = a and solve 
dz 
rT gy, 2) = = 
This gives z(y) = c1y, and then 
dy _ : 
dt = CY, 


whence y(t) = c2e%", co £ 0. 


10.4 The Cauchy problem 


We return to differential equations in their most general form (10.5) and discuss 
the solvability of the Cauchy problem. 

As in Sect. 10.2, let J C R be an open interval and D C R” an open connected 
set. Call Q = I x D C R”*!, and take a map f : 2 — R” defined on 2. Given 
points to in J and yo in D, we examine the Cauchy problem 


(10.34) 


10.4.1 Local existence and uniqueness 


Step one of our study is to determine conditions on f that guarantee the Cauchy 
problem can be solved locally, by which we mean on a neighbourhood of fo. 

It is remarkable that the continuity of f alone, in t and y simultaneously, is 
enough to ensure the existence of a solution, as the next result, known as Peano’s 
Existence Theorem, proves. 


Theorem 10.10 (Peano) Suppose f is continuous on 2. Then there exist 


a closed neighbourhood |tp —a,to +a] of to and a map y: [to —a, to +a] > D, 
differentiable with continuity, that solves the Cauchy problem (10.34). 


There is a way of estimating the size of the interval where the solution exists. 
Let By(to) and B,(yo) be neighbourhoods of to and yo such that the compact set 
K = Ba(to) x Bo(yo) is contained in 2, and set M = ee ll f(t, y)||. Then the 

Lye 


theorem holds with a = min(a, b/M). 
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Peano’s Theorem does not claim anything on the solution’s uniqueness, and in 
fact the mere continuity of f does not prevent to have infinitely many solutions 
to (10.34). 


Example 10.11 
The initial value problem 
y = 3a, 
y(0) =0, 
solvable by separating variables, admits solutions y(t) = 0 (the singular integral) 
and y(t) = V#3; actually, there are infinitely many solutions, some of which 
0 i $< a 
ult) = wal) = 4 (t—a)? ift>a, oe 
are obtained by ‘glueing’ the previous two in a suitable way (Fig. 10.4). 


That is why we need to add further hypotheses in order to grant uniqueness, 
besides requiring the solution depend continuously on the initial datum. 
A very common assumption is this one: 


Definition 10.12 A map f defined on (22= I x D is said Lipschitz in y, 
uniformly in t, over (2, if there exists a constant L > 0 such that 


F(t, yr) — F(E, yo) || < Lllyi — yell, Vy1,y2E€D, Vitel. (10.35) 


Vy 


Figure 10.4. The infinitely many solutions of the Cauchy problem relative to Ex- 
ample 10.11 
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Using Proposition 6.4 for every t € I, it is easy to see that a map is Lipschitz if 
all its components admit bounded y;-derivatives on (2: 


Oy; 


(t,y)EQ 


Examples 10.13 


i) Let us consider the function f(t, y) = y?, defined and continuous on R x R. 
Then Vy1, yo € R and Vt € R, 


If(t,y1) = f(t, ¥2)| = ly? — v2] = lyn + yal lyn — yal - 
Hence f is Lipschitz in the variable y, uniformly in t, on every domain (2 = 
Qr =Rx Dr with Dr = (—R, R), R > 0; the corresponding Lipschitz constant 
L = Lr equals Le = 2R. Note that the function is not Lipschitz on R x R, 
because |y; + y2| tends to +00 as y; and yz tend to infinity with the same sign. 


ii) Consider now the map f(t, y) = sin(ty), defined and continuous on R x R. It 
satisfies 


| sin(ty1) — sin(ty2)| < |(ty1) — (ty2)| = ltl lyr — yal, 
Vyi,y2 € R, Vt © R, because 6 + sin@ is Lipschitz on R with constant 1. 
Therefore f is Lipschitz in y, uniformly in t, on every region 2 = QrR = IpxR, 
with Ip = (-R,R), R > 0, with Lipschitz constant L = Lp = R. Again, f is 
not Lipschitz on the whole R x R. 


iii) Finally, let us consider the affine map f(t, y) = A(t)y + b(t), where A(t) € 
R™”, b(t) € R” are defined and continuous on the open interval J C R. Then f 
is defined and continuous on J x R”. For any yj, yo € R” and any t € J, we have 


F(t, yn) — F(t, y2) || = || Ay — A(t) yell 


= |A@ (an — v2)Il < IAI Ilys — yell, 
where ||A(t)|| is the norm of the matrix A(t) defined in (4.9), and the inequality 
follows from (4.10). Observe that the map a(t) = ||A(t)|| is continuous on J (as 
composite of continuous maps). If a is bounded, f is Lipschitz in y, uniformly 
in t on 2 =I X R, with Lipschitz constant L = sup,<; a(t). If not, we can at 
least say a is bounded on every closed and bounded interval J C I (Weierstrass’ 
Theorem), so f is Lipschitz on every 22 = J x R”. O 


The next result is of paramount importance. It is known as the Theorem of 
Cauchy-Lipschitz (or Picard-Lindelof). 


Theorem 10.14 (Cauchy-Lipschitz) Let f(t, y) be continuous on Q and 
Lipschitz in y, uniformly int. Then the Cauchy problem (10.34) admits one, 


and only one, solution y = y(t) defined on a closed neighbourhood [to — a, to + 
al of to, with values in 2, and differentiable with continuity on 2. 
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Figure 10.5. Local existence and uniqueness of a solution 


The solution’s uniqueness and its dependency upon the initial datum are con- 
sequences of the following property. 


Proposition 10.15 Under the assumptions of the previous theorem, let y, 
z be solutions of the Cauchy problems 


ee 


y(to) = yo, 


over an interval J containing to, for given yo and z) € D. Then 
lye) — z(t)|| <e7*lllyo — zl], Vee J. (10.36) 


In particular, if 29 = yo, z coincides with y on J. 


What the property is saying is that the solution of (10.34) depends in a continuous 
way upon the initial datum yo: a small deformation ¢ of the initial datum perturbs 
the solution at t 4 to by e/!*-*le at most. In other words, the distance between 
two orbits at time t grows by a factor not larger than e/!*—'l, Since this factor 
is exponential, its magnitude depends on the distance |t — to| as well as on the 
Lipschitz constant of f. 

For certain equations it is possible to replace e/!*—ol with e7—*) if t > to, 
or e7(to—*) if t < tg, with o < 0 (see Example 10.22 ii) and Sect. 10.8.1). In these 
cases the solutions move towards one another exponentially. 


Remark 10.16 Assume that a mathematical problem is formalised as 
P(s)=d 
where d is the datum and s the solution. Whenever s 


e exists for any datum d, 
e is unique, and 
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e depends upon d with continuity, 


the problem is said to be well posed (in the sense of Hadamard). 
Theorem 10.14 and the subsequent proposition guarantee the Cauchy prob- 
lem (10.34) is well posed with the assumptions made on f. Oo 


From the Theorem of Cauchy-Lipschitz we infer important consequences of 
qualitative nature regarding the solution set of the differential equation. To be 
precise, its hypotheses imply: 


e two integral curves with a common point must necessarily coincide (by unique- 
ness of the solution on the neighbourhood of every common point); 

e for autonomous systems in particular, the orbits ['(y), as y varies among 
solutions, form a disjoint partition of phase space; it can be proved that an 
orbit is closed if and only if the corresponding solution is periodic in time, 
y(t+T) = y(t), for all t € R and a suitable T > 0; 

e the solutions on 92 of the ODE depend on n real parameters, as anticipated 
in Sect. 10.2: given any tg € J, there is a distinct solution for every choice of 
datum yo on the open set D, and the coordinates c, = yo1,.--,¢n = Yon Of Yo 
should be considered as the free parameters upon which the solution depends. 


10.4.2 Maximal solutions 


The Theorem of Cauchy-Lipschitz ensures the existence of a closed interval [to — 
a,to + a], containing to, where the problem (10.34) is solvable; we shall call u = 
u(t) the corresponding solution henceforth. However, this interval might not be 
the largest interval containing to where the problem can be solved (the maximal 
interval of existence). In fact, suppose t; = to + a lies inside J and y; = u(t;) is 
inside D. Then we can re-ignite the Cauchy problem at ¢, 


{¥ = f(t,y), 
y(ti) =y1- 


Because the assumptions of Theorem 10.14 are still valid, the new problem has a 
unique solution v = v(t) defined on an interval [t; — 6,t; +] C J. Both functions 
u, v solve the ODE and satisfy the condition u(t1) = yi = v(ti) on the interval 
J = [to, ti] N [t1 — 8, t1], which contains a left neighbourhood of t; (see Fig. 10.6). 
Therefore they must solve problem (10.37) on J. We can use inequality (10.36) 
with to replaced by 1, to obtain 


(10.37) 


l|e(t) — v(t) || <e” ly, — yi] =0, 9 Vee J; 


that shows v must coincide with u on J. Since v is also defined on the right of ft, 
we may consider the prolongation 


in [tp —a,to +a], 


in [to +a,to+a+t 6], 
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i= £8 ti +6 


Figure 10.6. Prolongation of a solution beyond ft, 


that extends the previous solution to the interval [to — a,to + a+ 6]. A similar 
prolongation to the left of to — a is possible if to — a is inside J and u(to — q) still 
lies in D. 

The procedure thus described may be iterated so that to prolong u to an open 
interval Jmax = (t_,t+), where, by definition: 


t_ = inf{t, € I: problem (10.34) has a solution on [t., to]} 


and 
t, = sup{t* € I: problem (10.34) has a solution on [to, t*]}. 


This determines precisely the maximal interval on which a solution uw to prob- 
lem (10.34) lives. 

Suppose t is strictly less than the least upper bound of J (so that the solution 
cannot be extended beyond t,, despite f is continuous and Lipschitz in y). Then 


Yo 


> 


7s 4; 7 : 


Figure 10.7. Right limit t, of the maximal interval Jmax for the solution of a Cauchy 
problem 
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one could prove u(t) moves towards the boundary OD as t tends to t+ (from the 
left), see Fig. 10.7. Similar considerations hold for t_. 


Example 10.17 
Consider the Cauchy problem 


we 
y(0) = yo- 


The Lipschitz character of f(t,y) = y? was discussed in Example (10.13) i): 
for any R > 0 we know f is Lipschitz on QR = R x Dr, Dr = (—R, R). The 
problem’s solution is 


Yo 
t)= : 
y(t) f= 


If yo € Dr, it is not hard to check that the maximal interval of existence in Dr, 
i.e., the set of t for which y(t) € Dr, is 


1 1 . 
( OO, vs R? if Yo > 0, 
Jmax = < (—00, +00) if yo = 0, 
ee, if yo <0. 
yo R 
Notice that lim;:, y(t) = R when yo > 0, confirming that the solution reaches 
the boundary of Dr ast > ty. O 


10.4.3 Global existence 


We wish to find conditions on f that force every solution of (10.34) to be defined 
on the whole interval J. A solution for which Jmax = J is said to exist globally, 
and called a global solution. 

The petty example of an autonomous system in dimension 1, 


y’ = fly), 


is already sufficient to show the various possibilities. The map f(t,y) = f(y) = 
ay + 6 is Lipschitz on all R x R, and the corresponding ODE’s solutions 


are defined on R (see Example 10.4 i)). In contrast, f(t,y) = f(y) = y? is not 
Lipschitz on the entire R? (Example 10.13 i)) and its non-zero solutions exist on 
semi-bounded intervals (Example 10.17). The linear example is justified by the 
following sufficient condition for global existence. 


Theorem 10.18 Let QQ = I x R” and assume Theorem 10.14 holds. Then 


the solution of the Cauchy problem (10.34) is defined everywhere on I. 
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This result is coherent with the fact, remarked earlier, that if the supremum t+ 
of Jmax is not the supremum of J, the solution converges to the boundary of D, 
as t > t, (and analogously for t_). In the present situation D = R” has empty 
boundary, and therefore the end-points of Jmax must coincide with those of J. 

Being globally Lipschitz imposes a severe limitation on the behaviour of f as 
y — co: f can grow, but not more than linearly. In fact if we choose y, = y € R” 
arbitrarily, and yo = O in (10.35), we deduce 


F(t,¥) — F(t,0)|| < Lily],  VyeR”. 
The triangle inequality then yields 


F(t, wll = |F@,0) + Fy) — FE, O)I 
< FE, + Fy) — FEO), 


so necessarily 


IFE WI SFG ON + Lllyll, yeR". 


Observe that f continuous on 2 forces the map (A(t) = ||f(t,0)|| to be continuous 
on I. 
Consider the autonomous equation 


y’ = ylog’y, 


for which f(t,y) = f(y) = ylog® y grows slightly more than linearly as y + +00. 
The solutions 
y(t) =e Vere) 


are defined over not all of R, showing that a growth that is at most linear is close 
to being optimal. Still, we can achieve some level of generalisation in this direction. 
We may attain the same result as Theorem 10.18, in fact, imposing that f 
grows in y at most linearly, and weakening the Lipschitz condition on I x R”; 
precisely, it is enough to have f locally Lipschitz (in y) over 2. Equivalently, f 
is Lipschitz in y, uniformly in t, on every compact set K in 92; if so, the Lipschitz 
constant LD. may depend on kK. All maps considered in Examples 10.13 are indeed 
locally Lipschitz (on R x R for the first two examples, on J x R” for the last). 


Theorem 10.19 Let f be continuous and locally Lipschitz in y, uniformly 
in t, over 22 =I x R”; assume 


IFE wll so@Mllyll+6@, VvyeR", Veer. (10.38) 


with a,B continuous and non-negative on I. Then the solution to the Cauchy 
problem (10.34) is defined everywhere on I. 


Moreover, if a and 8 are integrable on I (improperly, possibly), every solution 
is bounded on I. 
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A crucial application regards linear systems. 


Corollary 10.20 Let A(t) € R™”, b(t) € R” be continuous on the open 
interval I CR. Then any solution of the system of ODEs 


y = A(t)y + b(t) 


is defined on the entire I. In particular, the corresponding Cauchy prob- 
lem (10.34) with arbitrary yo € R” admits one, and one only, solution over 
all of I. 


Proof. The map f(t,y) = A(t)y + b(t) is locally Lipschitz on I x R”, by Ex- 
ample 10.13 iii). Following that argument, 


IFG WI SAMMY +OOl, VyeR", veel, 


and a(t) = || A(t)||, G(¢) = ||b(¢)|| are continuous by hypothesis. The claim 
follows from the previous theorem. Oo 


Application: the simple pendulum (II). Let us resume the example of p. 430. 
The map f(y) has 
0 1 
JF(y) = & COS Yq a 
so the Jacobian’s components are uniformly bounded on R?, for 


Ofi - 

FAC] <maxtta), wweR?, 11752 

OY; 

then f is Lipschitz on the whole of R?. Theorems 10.14, 10.18 guarantee existence 
and uniqueness for any (00,01) € R?, and the solution exists for all t > 0. 
(Continues on p. 453.) Oo 


10.4.4 Global existence in the future 


Many concrete applications require an ODE to be solved ‘in the future’, rather 
than ‘in the past’: it is important, namely, to solve for all t > to, and the aim is 
to ensure the solution exists on some interval J bounded by to on the left (e.g., 
[to, +oo)), whereas the solution for t < to is of no interest. The next result cashes 
in on a special feature of f to warrant the global existence of the solution in the 
future to an initial value problem. 

A simple example will help us to understand. The autonomous problem 


o =-y? 
y(0) = ¥0, 
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is determined by f(y) = —y?: this is locally Lipschitz on R, and clearly grows faster 
than a linear map as y — oo; therefore it does not fulfill the previous theorems, 
as the solution a 
iC = ————>, (10.39 

) Vit 2yat ) 
not defined on the entire R, confirms. Yet y exists on [0, +00), whichever the initial 
datum yo. The fact that the solution does not ‘blow up’, as t increases, can be 
derived directly from the differential equation, even without solving. Just multiply 
the ODE by y and observe 


then 


i.e., the quantity E(y(t)) = sly(t |? is non-increasing as t grows. In particular, 


1 1 
SOP < 5lWOIP? = sluol*, ve>o, 


SO 
ly(t)| < |yol, Vt>0. 


At all ‘future’ instants the solution’s absolute value is bounded by the initial value, 
which can also be established from the analytic expression (10.39). 

The argument relies heavily on the sign of f(y) (flipping from —y? to +y? 
invalidates the result); this property is not ‘seen’ by other conditions like (10.35) 
or (10.38), which involve the norm of f(t, y). 

This example is somehow generalised by a condition for global existence in the 
future. 


Proposition 10.21 Let f be continuous and locally Lipschitz in y, uniformly 
ie Geant) — a) x IN ay 


Ue fie) = Ue Vy ER”, Vitel, (10.40) 


the solution of the Cauchy problem (10.34) exists everywhere to the right of 
to, hence on J = IM [to, +00). Moreover, one has 


ly@|l <llyoll,  Veed. (10.41) 


Proof. Dot-multiply the ODE by y: 
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As 


n 


dy; d 
yy = Dap B= 5 Gute salu, 


oe 
ey = = 
7 (glial?) <0 


whence ||y(t)|| < ||yo|| for any t > to where the solution exists. Thus we 
may use the Local Existence Theorem to extend the solution up to the 
right end-point of J. O 


we have 


A few examples will shed light on (10.40). 


Examples 10.22 


i) Let y’ = Ay, with A a real skew-symmetric matrix, in other words A satisfies 
A?’ =—A. Then 


y-fy)=y Ay=0, VWyeR", 
because y- Ay = y? Ay = (Ay)’ y = y' Aly = —y! Ay, so this quantity must 


be zero. Hence 
d 1 9 
= (silul?) = 


i.e., the map E(y(t)) = $||y(t)||? is constant in time. We call it a first integral 
of the differential equation, or an invariant of motion. The differential system is 
called in this case conservative. 


ii) Take y’ = —(Ay+g(y)), with A a symmetric, positive-definite matrix, and 
g(y) of components 


(9(y)), =m), ls<i<n, 
where ¢: R > R is continuous, ¢(0) = 0 and sd(s) > 0, Vs ER (e.g., ¢(s) = 8? 
an in the initial discussion). Recalling (4.18), we have 
y- f(y) =—-(y-Ay+y-g(y)) <—-y- Ay < —rAxlyll? <0, Vy eR”, 
with A, > 0. Setting as above E(y) = 4|ly||?, one has 


d 
Ely(t)) <—rellyOIP, VE> to, 
from which we deduce 
E(y(t)) < e~?**(t) B(yg), = VE > to; 


therefore E(y(t)) decays exponentially at t increases. Equivalently, 
lye) <e +O [yo], t > to, 
and all solutions converge exponentially to 0 as t + +oo. In Sect. 10.8 we shall 


express this by saying the constant solution y = O is asymptotically uniformly 
stable, and speak of a dissipative system. 
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Assuming furthermore ¢ convex, one could prove that two solutions y, z starting 
at Yo, Zo satisfy 
lIy(é) — z@|| se) |yo — zol], -t> to, 
a more accurate inequality than (10.36) for t > to. 


The map y +> E(y) is an example of a Lyapunov function for the differential 
system, relative to the origin. The name denotes any regular map V : B(O) > R, 
where B(O) is a neighbourhood of the origin, satisfying 


e V(O)=0, V(y) >0 on B(O) \ {0}, 


e <—V(u(t) <0 for any solution y of the ODE on B(0). Oo 


10.4.5 First integrals 


Take an autonomous equation y’ = f(y), with Lipschitz f : DCR” > R”. 


Definition 10.23 A scalar map @ : D + R is a first integral of the 
equation if ® is constant along every orbit of the differential equation, 1.e., 


&(y(t)) = constant for every solution y(t) of the ODE. 


A first integral is naturally defined up to an additive constant. 

If the equation has a first integral ®, each orbit will be contained in a level set of 
®. The study of level sets can then provide useful informations about the solution’s 
global existence. For instance, if the orbit of a particular solution belongs to a level 
set that does not touch the boundary of D, we can be sure the solution will exist 
at any time. 

Likewise, the study of the solutions’ asymptotic stability (see Sect. 10.8) could 
profit from what the level sets of a first integral can tell. 


To start with, consider the autonomous equation of order two 
y+ g(y) =9, (10.42) 


where g : R > RisC!. Such kind of equations play a paramount role in Mechanics 
(read, Newton’s law of motion). Another incarnation is equation (10.11) regulating 
a simple pendulum. 


Let JT be an arbitrary primitive of g on R, so that at (y) =g(y), VyER If 
y = y(t) is a solution for t € J then, we can multiply (10.42) by y’, to get 
d (dy\ dy dil dy _ 0 
dt \dt/ dt dy dt —’ 


Lé., 


a (; (y'(t))? + m(y(d)) <i: eg. 
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Thus 


E(y,y') = = (y')? + Hy) 


NO is 


is constant for any solution. One calls [7(y) the potential energy of the mechan- 
ical system described by the equation, and K(y’) = 4(y’)? is the kinetic energy. 
The total energy E(y,y’) = K(y’) + I(y) is therefore preserved during motion 
(the work done equals the gain of kinetic energy). 

The function E turns out to be a first integral, in the sense of Definition 10.23, 
for the autonomous vector equation y’ = f(y) given by 


—_— 
{ ie (10.43) 
Y2 = —g(y1) 5 


equivalent to (10.42) by putting y; = y, yo = y’. In other words @(y) = E(y, ye) = 
U3 + IT(y1), and we may study the level curves of ® in phase space yiy2 to 
understand the solutions’ behaviour. At the end of the section we will see an 
example, the undamped pendulum. 


Generalising this picture we may say that an autonomous equation y’ = f(y) 
over a connected open set D of the plane admits infinitely many first integrals if 
f is the curl of a scalar field ® on D, i.e., f(y) = curl ®(y). 


Property 10.24 Let © be aC! field on D C R?. The equation 


y =curl® 


admits ® as first integral. 


Proof. If y = y(t) is a solution defined for t € J, by the chain rule 


d 
—P(y) = (grad ®) - oF (grad &) - (curl) = 0, 
dt dt 
because gradient and curl are orthogonal in dimension two. Oo 


Bearing in mind Sect. 7.2.1 with regard to level cuves, equation (10.44) forces y 
to move along a level curve of @. 

By Sect. 9.6, a sufficient condition to have f = curl@ is that f is divergence- 
free on a simply connected, open set D. In the case treated at the beginning, 
f(y) = (ye, -g(y1)) = curl E(y1, y2), and div f = 0 over all D = R?. 


To conclude, equation (10.44) extends to dimension 2n, if y = (yi,y2) € 
R” x IR” = R?” is defined by a system of 2n equations 
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(10.45) 


where @ is a map in 2n variables and each partial derivative symbol denotes the n 
components of the gradient of ® with respect to the indicated vectorial variable. 
Such a system is called a Hamiltonian system, and the first integral @ is known 
as a Hamiltonian (function) of the system. An example, that generalises (10.42), 
is provided by the equation of motion of n bodies 


z + grad IT(z) = 0, zéER", (10.46) 


with JJ a function of n variables. Proceeding as for n = 1 we can transform the 
second-order equation into a system of order one like (10.45). The Hamiltonian & 
is the total energy ®(y1, y2) = E(z, 2’) = §||z’||? + H(z) of the system. 

When D = R”, it can be proved that the ODE’s solutions are defined at all 


times if the potential energy is bounded from below. 


Application: the simple pendulum (III). The example continues from p. 448. 
The simple pendulum (a = 0) is governed by 


: 

sll +ksind=0. 
The equation is of type (10.42), with g(@) = ksin6@; therefore the potential energy 
is IT(0) = c—kcos9, with arbitrary constant c. It is certainly bounded from below, 
confirming the solutions’ existence at any instant of time t € R. The customary 
choice c = k makes the potential energy vanish when @ = 27¢, ¢ € Z, in corres- 
pondence with the lowest point S of the bob, and maximises JJ(0) = 2k > 0 for 
0 = (204 1), & € Z, when the highest point IJ is reached. With that choice the 
total energy is 


E(0,0') = ~(6’)? + k(1 — cos8), 


OE 


or 


L 
E(yi,y2) = 502 + k(1 — cos y1) (10.47) 


in the phase space of coordinates y; = 0, y2 = 6’. Let us examine this map on R?: 
it is always > 0 and vanishes at (27,0), € € Z, which are thus absolute minima: 
the pendulum is still in the position S. The gradient VE(y1, y2) = (Ksin y1, y2) is 
zero at (70,0), 2 € Z, so we have additional stationary points ((2¢+ 1)r, 0), LEZ, 
easily seen to be saddle points. The level curves E(y1, y2) = c > 0, defined by 


yo = £\/2(c—k) + 2k cosy, , 


have the following structure (see Fig. 10.8): 
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Y2 
c> 2k 


Figure 10.8. The orbits of a simple pendulum coincide with the level curves of the 
energy E(y1, y2) 


e When c < 2k, they are closed curves encircling the minima of E; they cor- 
respond to periodic oscillations between 9 = —6 and 6 (plus multiples of 
27), where 69 is determined by requiring that all energy c be potential, that is 
k(1 — cos 60) =c. 

e When c = 2k, the curve’s branches connect two saddle points of E’, and corres- 
pond to the limit situation in which the bob reaches the top equilibrium point 
I with zero velocity (in an infinite time). 

e When c > 2k, the curves are neither closed nor bounded; they correspond to 
a minimum non-zero velocity, due to which the pendulum moves past the top 
point and then continues to rotate around O, without stopping. 


(Continues on p. 486.) O) 


10.5 Linear systems of first order 


The next two sections concentrate on vectorial linear equations of order one 
y = A(t)y + W(t), (10.48) 


where A is a map from an interval J of the real line to the vector space R"*” of 
square matrices of order n, while 6 is a function from J to R”. We shall assume A 
and b are continuous in t. Equation (10.48) is shorthand writing for a system of 
n differential equations in n unknown functions 
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yy = air(t)y1 + a12(t)y2 +... + ain(t)yn + b1(€) 
yo = aa1(t)yi + az2(t)y2 +... + Gan(t)yn + b2(t) 


yf, = ani(t)yi + ana(t)ye + ...+@an(t)yn + bn(t). 


Remark 10.25 The concern with linear equations can be ascribed to the fact 
that many mathematical models are regulated by such equations. Models based 
on non-linear equations are often approximated by simpler and more practicable 
linear versions, by the process of linearisation. The latter essentially consists in 
arresting the Taylor expansion of a non-linear map to order one, and disregarding 
the remaining part. 

More concretely, suppose f is aC! map on in the variable y and that g(t), 
t € J CT, is a known solution of the ODE y’ = f(t, y) (e.g., a constant solution 
y(t) = Yo). Then the solutions y(t) that are ‘close to y(t)’ can be approximated 
as follows: for any given t € J, the expansion of y'> f(t, y) around the point y(t) 


reads 
f(t,y) = F(t, 


with g(t, y) = o(y — y( 
with respect to y only. 


y(t) + Jy f(t, y()) (y — u(t) + o(t,y) (10.49) 
(t) )) for y > y(t); Jyf denotes the Jacobian matrix of f 
Set 

A(t) = Jyf (t, 9) 


and note that f(t, y(t)) = y(t), y being a solution. Substituting (10.49) in y’ = 
f(t, y) gives 

(y—y) =Alt)\(y—y)+9lty); 
ignoring the infinitesimal part g then, we can approximate the solution y by setting 
z~y-—y and solving the linear equation 


z’=A(t)z; 


if yo is defined by the initial condition y(to) = yo at to € J, z will be determined 
by the datum z(to) = yo — y(to). Once z is known, the approximation of y(t) is 
y(t) = y(t) + 2(¢). 

In the special case of autonomous systems with a constant solution Yo, the 
matrix A is not time-dependent. O 


We will show in the sequel that equation (10.48) admits exactly n linearly 
independent solutions; as claimed in Sect. 10.2, therefore, the general integral 
depends on n arbitrary constants that may be determined by assigning a Cauchy 
condition at a point to € J. Furthermore, if A does not depend on ft, so that the aj; 
are constants, the general integral can be recovered from the (possibly generalised) 
eigenvalues and eigenvectors of A. 

We begin by tackling the homogeneous case, in other words b = 0. 
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10.5.1 Homogeneous systems 


Consider the homogeneous equation associated to (10.48), namely 
PS, we (10.50) 


and let us prove a key fact. 


Proposition 10.26 The set So of solutions to (10.50) is a vector space of 


dimension n. 


Proof. Since the equation is linear, any linear combination of solutions is still a 
solution, making Sp a vector space. 
To compute its dimension, we shall exhibit an explicit basis. To that end, 
fix an arbitrary point tg € J and consider the n Cauchy problems 


Ae tel, 
ito) = ei, 


where {e;};=1,....n is the canonical basis of R”. Recalling Corollary 10.20, 
each system admits a unique solution y = u;(t) of class Ct on J. The maps 
n 


u;,1 = 1,...,n, are linearly independent: in fact, if y = So aii is the 
i=1 


Tr 
zero map on J, then >. a;u;(t) = 0 for any t € I, so also for t = to 
é=1 


n i) 
a a,uj(to) = 2 a,e; = 0, 
=i i 


whence a; = 0, Vi, proving the linear independence of the u,. 
Eventually, if y = y(t) is an element in So, we write the vector yo = y(to) 
nm 


as Yo = > yoiei; then 
i=l 


n 
y(t) = >> yorui(t), 
i=1 
because both sides solve the equation and agree on to, and the solution is 
unique. Oo 


The set {u;}i=1,....n of maps satisfying 
ee tel, 


ui(to) = e:, (10.51) 
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is an example of a fundamental system of solutions. 


Definition 10.27 Any basis of So is called a fundamental system of solu- 


tions of the ODE (10.50). 


There is a useful way to check whether a set of n solutions to (10.50) is a 
fundamental system. 


Proposition 10.28 Let wj,...,Wn € So. 


a) If at a point to € I the vectors wi(to),...,Wn(to) are linearly independent, 


W1,...,Wn constitute a fundamental system of solutions. 
b) Ifwi,...,Wn form a fundamental system of solutions, at each point to € I 
the vectors wi(to),...,Wn(to) are linearly independent. 


Proof. a) It suffices to show that wi,...,w, are linearly independent. Suppose 
then there exist coefficients c; such that 


> cw; (t) = 0, Vitel. 
t=1 


Choosing t = tg, the hypothesis forces every c; to vanish. 
b) Suppose there exist c1,...,Cn with 


3 ci; wi (to) —1 OF 
a1 


Define the map z(t) = a, c;w;(t); since it solves the Cauchy problem 
i=1 


{ g = Az 

z(to) =0 ) 
by uniqueness we have z(t) = 0 for any t € J. Therefore all c; must be 
Zero. O 


Owing to the proposition, the linear dependence of n vector-valued maps on I 
reduces to the (much easier to verify) linear dependence of n vectors in R”. 


We explain now how a fundamental system permits to find all solutions 
of (10.50) and also to solve the corresponding initial value problem. Given then 
a fundamental system {w1,...,w,}, for any t € I we associate to it the n x n 
matrix 
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W(t) = (wi(t),..-, wn(t)) 


whose columns are the vectors w;(t); by part b) above, the matrix W(t) is 
non-singular since its columns are linearly independent. This matrix is called 
the fundamental matrix, and using it we may write the n vectorial equations 
wi = A(t)w;,i=1,...,n, in the compact form 


W' = A(t)W. (10.52) 


Every solution y = y(t) of (10.50) is represented as 


y(t) = a c;w;(t) 


for suitable constants c; € R. Equivalently, setting c = (c1,...,¢n)", we have 


y(t) = Wit)e. (10.53) 


The solution to the generic Cauchy problem 


= A on I, 
{ ic wa (10.54) 
y(to) = Yo; 
is found by solving the linear system 
Wto) C= Yo. (10.55) 


Therefore we may write the solution, formally, as 
y(t) = W(t)W (to) yo- 


We can simplify this by choosing the fundamental matrix U(t) associated with the 
special basis {u;} defined by (10.51); it arises as solution of the Cauchy problem 
in matrix form 


{ U'=A(t)U on, te 


U(to) =T, 


and allows us to write the solution to (10.54) as 


We put on hold, for the time being, the study of equation (10.50) to discuss 
non-homogeneous systems. We will resume it in Sect. 10.6 under the hypothesis 
that A be constant. 
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10.5.2 Non-homogeneous systems 


Indicate by S; the set of all solutions to (10.48). This set can be characterised 
starting from a particular solution, also known as particular integral. 


Proposition 10.29 Given a solution y, of equation (10.48), Sp is the affine 


SHace Oh — Unt 0; 


Proof. If y € Sy, by linearity y — yp solves the homogeneous equation (10.50), 
hence y — Yp € So. 


The proposition says that for any given fundamental system wy,...,w,, of solu- 
tions to the homogeneous equation, each solution of (10.48) has the form 


for some c = (c1,..-,;€n)? € R”. 


An alternative route to arrive at the general integral of (10.48) is the method of 
variation of constants, already seen in Sect. 10.3.3 for scalar equations. By (10.53) 
we look for a solution of the form 


with c(t) differentiable to be determined. Substituting, we obtain 
W(t) c(t) + W(t) c/(t) = A(t) W(t) c(t) + b(t). 
By (10.52) we arrive at 
Wit) c'(t)= b(t), from which ~~’ (t) = W(t) W(t), 
1.e., 


d= / W(t) b(t) dt, (10.57) 


where the indefinite integral denotes the primitive of the vector W~1(t) b(t), com- 
ponent by component. The general integral of (10.48) is then 


y(t) = Wit) / W(t) b(t) dt. (10.58) 
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Example 10.30 Consider the equation 


1 e~ 
2 —2 42 — 1+? 
v=( _ Jv Vie t ve (10.59) 


Vi-@ 1+8 


A fundamental system of solutions of the homogeneous equation is 


wi(t) = @ . wat) =e"( 11) 


(as we shall explain in the next section). The fundamental matrix and its inverse 
are, thus, 


Consequently ' 


W-lityo(t) = | VIO ® 


1+? 
[wow ees ( arcsint + cj; ) . 


arctant + C9 


sO 


Due to (10.58), the general integral is 
(i ( arcsint + c; + e“*(arctant + c2) ) 
Yb) = . 


O 
arcsint + c, — e*#(arctant + c2) 
Whenever we have to solve the Cauchy problem 
‘= A(t)y+0(t) onT, 
hernia ae 
y(to) = yo, 


it is convenient to write (10.57) as 
t 
e(t) =c+ / W~'(s) b(s) ds. 
to 
with arbitrary c € R”. Equation (10.58) then becomes 
t 
y(t) = Wit)e+ / W(t)W'(s) b(s) ds; 
to 


putting t = to gives y(to) = Wto)c, so c is still determined by solving (10.55). 
Altogether, the solution to the Cauchy problem is 


y(t) = Ynom(t) + yp(t), 
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where Ynom(t) = W(t)W ~1(to)yo solves the homogeneous system (10.54), while 


(10.61) 


solves the non-homogeneous problem with null datum at t = to. 


Example 10.31 Let us solve the Cauchy problem (10.60) for the same equation 
of the previous example, and with y(0) = (4,2)7. The general integral found 


earlier is 
Cy te**e, arcsint + e* arctant 
y(t) = es le ade cdt 
ci — eco arcsint — e’ arctant 
the first bracket represents the solution ynom of the associated homogeneous 


equation, and the second is the particular solution y, vanishing at t = 0. The 
constants c 1, C2 are found by setting t = O and solving the linear system 


(4) (e)=G), 


whence c; = 3 and co = 1. O 


10.6 Linear systems with constant matrix A 


We are ready to resume equation (10.48), under the additional assumption that A 
is independent of time. We wish to explain a procedure for finding a fundamental 
system of solutions for 


y = Ay on J=R. (10.62) 


The method relies on the computation of the eigenvalues of A, the roots of the 
so-called characteristic polynomial of the equation, defined by 


x(A) = det(A — AT), 
and the corresponding eigenvectors (perhaps generalised, and defined in the se- 
quel). 


At the heart of the whole matter lies an essential property: 


if A is an eigenvalue of A with eigenvector v, the map 


w(t) =e*v 


solves equation (10.62) (in C, if A is complex). 
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Indeed, using (10.28) and Av = Av, we have 


Mt 
w'(t) = 7 (e*'v) = << v=drA4c*!v=e“AV=A (e*'v) = Awi(t). 


At this point the treatise needs to split in three parts: first, we describe the 
explicit steps leading to a fundamental system, then explain the procedure by 
means of examples, and at last provide the theoretical backgrounds. 

To simplify the account, the case where A is diagonalisable is kept separate. 
Diagonalisable matrices include prominent cases, like symmetric matrices (whose 
eigenvalues and eigenvectors are real) and more generally normal matrices (see 
Sect. 4.2). Only afterwards we examine the more involved situation of a matrix 
that cannot be diagonalised. 

In the end we illustrate a solving method for non-homogeneous equations, with 
emphasis on special choices of the source term D(t). 


10.6.1 Homogeneous systems with diagonalisable A 


Suppose A has # real eigenvalues \1,..., Ag and n — @ complex eigenvalues which, 
being A real, come in m complex-conjugate pairs Ne41, Ae41; seg AB Nerm} ei- 
genvalues are repeated according to their algebraic multiplicity, hence 0+ 2m = n. 
Each real Ay, 1 < k < £, is associated to a real eigenvector v;, while to each pair 
(Acek; Aetk), 1 < k < m, corresponds a pair (ve14, Ber) Of complex-conjugate 
eigenvectors. As A is diagonalisable, we can assume the eigenvectors are linearly 
independent (over C). 


e For each real eigenvalue Az, 1 < k < £, define the map 


e For each pair (Act, Ark); 1 <k< m, we decompose eigenvalues and eigen- 
vectors into real and imaginary parts: Ag+, = On + iw, Vere = vy) + iv? 


Define maps 


= Re (er+# aps 3) = e7HE (yl) cos wt — y?) sin wht) ; 


(t) =n (era 5) = ert (yl) sin wt + y\”) cos wt) : 


Proposition 10.32 With the above conventions, the set of functions 


Ht Wee ae tle wi w”), ee wy, wy} is a fundamental system of solutions 


to (10.62). 


10.6 Linear systems with constant matrix A 463 


The general integral is then of the form 


L m 
= Ss cpr wr(t) + y+ S- (et (c? w(t +c? w?)(t)) 
k=1 ik 


HUD Gina 2h (ey pasos Ge I 


The above is nothing but the general formula (10.53) adapted to the present situ- 
ation; W(t) is the fundamental matrix associated to the system. 


Examples 10.33 
i) The matrix 


a=(¥ 3) 


admits eigenvalues A, = 1, Ao = —3 and corresponding eigenvectors 


wi) ah eel), 


As a consequence, (10.63) furnishes the general integral of (10.62): 


1 x 1 eye! + cge7 
ee: 3t _ faa 2 
y(t) =cie (;) + €2e (,) = & =e) * 


For a particular solution we can consider the Cauchy problem with initial datum 


y(0) = yo= (*) 


for example, corresponding to the linear system 
Ci +eg =2 
{ Ci -Cg=1. 
Therefore c; = 3/2, cg = 1/2 and the solution reads 
1 (3e° +e°%* 
y(t) = 5 & _ a . 


ii) The matrix 


1 1 O 
A=|0 -1 1 
0 -—-10 5 


has eigenvalues A, = 1, Ay = 2+i and its conjugate Ay = 2—i. The eigenvectors 
are easy to find: 
il i 1 0 1 0 
y= {[O}], vwo=] 1t+2}=t1)r+i7t1 
0 2+ 41 2 4 2 4 


S| 
bo 
| 

eH 
| 
~ 

eH 
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Therefore (10.63) (we write co = cf) and c3 = ce?) for simplicity), tells us that 
the general integral of (10.62) is 


1 | 1 0 | 
y(t) =cie’ | 0] + tac 1 |] cost— | 1] sint; + 
2 


0 4 
1 0 
+c3e : sint + ; cost 
1 0 
=c, [0 }eé+ le 1 e7 cost + 
0 4 
0 
—co| 1) +c3 e* sint 
4 


cye’ + c9e”* cost + cge”" sint 
= (co — c3)e”* cost + (c3 — co)e”’ sint 


(cg — 4c3)e* cost + (2c3 — 4cz)e” sin t 


Explanation. Since eigenvalues and eigenvectors of A may not be real, it is 
convenient to think (10.62) within the complex field. Thus we view y = y(t) asa 
map from R to C”; the real part y,(t) and the imaginary part yj(t) solve the real 
systems 

y, = Ay, and y;, = Ay; onl =R. 


All subsequent operations are intended over C (clearly, complex arithmetic is 
not necessary when A is diagonalisable over R, when all eigenvalues—hence all 
eigenvectors—are real). Call A = diag(\i,..., An) the diagonal matrix with the ei- 
genvalues as entries, and P = (v1,..., Un) the square matrix with the eigenvectors 


as columns. Thus A becomes 
A=PAP". (10.64) 


Let y be an arbitrary complex solution of (10.62); substituting (10.64) in equa- 
tion (10.62), and multiplying by P~! on the left, gives 


(P-'y) = A(P~'y), 


by associativity. Now setting 


(10.65) 
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equation (10.62) becomes diagonal 


coe (10.66) 
In other words we obtain n equations 
Zp, = Anes l<k<n. 
By Remark 10.5 these have solutions 
zp(t) = dpe**, 1<k<n, 


with d, € C arbitrary constants. Using the diagonal matrix 


(10.67) 


and the constant vector d = (dj,..., ie) € C”, we can write the solutions as 
z(t) =e4*d. 


Writing in terms of the unknown y, by the second equation in (10.65), we have 
y(t) = Pe“‘d; (10.68) 


it is of the general form (10.53) with W(t) = Pe‘. Making the columns of this 
matrix explicit, we have 


y(t) = dye™!*v, +--+» + dne*tvn = diwi(t) +++: + dnwn(t), (10.69) 


where 


wr, (t) = ert up , Lh Ws 


The upshot is that every complex solution of equation (10.62) is a linear com- 
bination (with coefficients in C) of the n solutions w;, which therefore are a 
fundamental system in C. 

To represent the real solutions, consider the generic eigenvalue A with eigen- 
vector v. Note that if \ € R (so v € R”), the function w(t) = e*v is a real solution 
of (10.62). Instead, if A\=o0+iw€C,sov= v) + iv © C”, the functions 


w(t) = Re (ev) = a (yl?) cos wt — v) sin wt) , 
w')(t) = Im(ev) = e7!(v sinwt + v®) cos wt) , 


are real solutions of (10.62); this is clear by taking real and imaginary parts of the 
equation and remembering A is real. 

In such a way, retaining the notation of Proposition 10.32, we obtain n real 
solutions 


wi(t),...,we(t) and w(t), w)(#),..., w(t), w(t). 
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There remains to verify they are linearly independent (over R). With the help of 
Proposition 10.28 with tp = 0, it suffices to show v1,..., Ug, vw), vi), aed vs), vi?) 
are linearly independent over R. An easy Linear Algebra exercise will show that 
this fact is a consequence of the linear independence of the eigenvectors U1,..., Un 


over C (actually, it is equivalent). 


Remark 10.34 Imposing the datum y(0) = yo for tp = 0 in (10.68), solution of 
y' = Ay, we find Pd = yo, so d= P~'ypo. The solution of the Cauchy problem 


{ y =Ay onlI=R, 
y(0) = Yo 
can therefore be represented as 


y(t) =e" yo, (10.70) 


where we have used the exponential matrix 


The above formula generalises the expression y(t) = eyo for the solution to the 
Cauchy problem y’ = ay, y(0) = yo - 

The representation (10.70) is valid also when A is not diagonalisable. In that 
case though, the exponential matrix is defined otherwise 


This formula is suggested by the power series (2.24) of the exponential function 
e”; the series converges for any matrix A and for any t € R. Oo 


10.6.2 Homogeneous systems with non-diagonalisable A 


By A1,.-.,Ap we number the distinct eigenvalues of A, while j1; denotes the algeb- 
raic multiplicity of Az, i-e., the multiplicity as root of the characteristic polynomial 
det(A—AI). Then n = fi+...+/p. Call mg < py the geometric multiplicity of Ax, 
that is the maximum number of linearly independent eigenvectors v1, .-.,Uk,m, 
relative to Ax. We remind A is diagonalisable if and only if the algebraic and geo- 
metric multiplicities of every eigenvalue coincide; from now on we suppose Mz < [LE 
for at least one Az. Then it is possible to prove there are dy = wz — Mx vectors 


Tk,l,+++,Tk,d,, called generalised eigenvectors associated to Az, such that the 
[lk VeCtOrs UE1,--+,;UVkjmes Tks +++5Tk,d, are linearly independent. Moreover, the 
collection of eigenvectors and generalised eigenvectors, for k = 1,...,p, builds a 


basis of C”. 
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Let us see how to construct generalised eigenvectors associated to Az. For every 


eigenvector vz,¢, 1 < £ < mx, define ro = = vx,¢ and seek whether a solution of 


(A-ApD)r = ry (10.71) 


exists. The matrix A — A;JI is singular, by definition of eigenvalue. The system 
may not have solutions, in which case the eigenvector v,z,¢ will not furnish gener- 


(?) 


alised eigenvectors. If however the system is consistent, a solution r,; will be a 


generalised eigenvector associated to \;,. Now we substitute ri) 7 to 10) (10.71). 


and repeat the argument. This will produce a cascade of linear systems 
(A-ADr?) =r), (10.72) 


that stops once the system (A—A;I)r = ry h > 0, has no solutions. As a result, 
we obtain gz,¢ = h — 1 generalised eigenvectors. Varying = 1,...,m, will yield 
all the dy = qk +--.-+4k,m, generalised eigenvectors relative to Ax. 

Let us now move to fundamental systems of solutions for (10.62). 
e For every eigenvalue A; (1 < k < p) and associated eigenvector vz¢ (1 < 6 < 

mx), define 

0 
wert) =e ap ¢. 


e If qx. > 0, so if there are generalised eigenvectors ry ) (1 <h < gee), build 


maps 


wl}(0) = (tr) + rfl) =e (tone + HA), 
wo) = 0 (Srl + rk + 22), 


and so on; in general, we set 


Proposition 10.35 With the above notations, the set of functions {wt} : 
Ll<k<pl1<<mp,0<h < que} is a fundamental system of solutions 
over C of (10.62). 
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Hence the general integral reads 


qk ,é 


Ne 


Cre WE ¢ 


with A) EC. 


If we wish to represent real solutions only, observe that Ax, © R implies all 


eigenvectors (proper and generalised) are real, hence also the maps wt) (t), built 


from them, are real. In presence of a complex-conjugate pair Ax, Ay = Ax of 
eigenvalues, the corresponding eigenvectors (generalised or not) will crop up in 
conjugate pairs. For that reason it is enough to replace each pair of complex- 
conjugate functions, defined by those vectors, with their real and imaginary parts. 
The construction of a real fundamental system for (10.62) is thus complete. 


Remark 10.36 We have assumed A not diagonalisable. But actually the above 
recipe works even when the matrix can be diagonalised (when the algebraic mul- 
tiplicity is greater than 1, one cannot know a priori whether a matrix is diagon- 
alisable or not, except in a few cases, e.g., symmetric matrices). If the matrix is 
diagonalisable, systems (10.71) will be inconsistent, rendering (10.73) equivalent 
to the solution of Sect. 10.6.1. 


Examples 10.37 


i) The matrix 


has eigenvalues A; = 5, A2 = 3. The first has algebraic multiplicity u, = 1, and 
vii = (0,1,0)” is the associated eigenvector. The second eigenvalue has algebraic 
multiplicity 2 = 2 and geometric mz = 1. The vector va; = (1,—-1,1)7 is 
associated to Az. By solving (A—3I)r = v2) we obtain a generalised eigenvector 


rsp = (1,—1,0)? which, together with v11, vei, gives a basis of R*. Therefore, 


the general integral of (10.62) is 


0 1 1 1 
y(t) =cye™* | 1 | +eoe%% | -1 | + c3e%4 Jt | -1 } + | -1 
0 1 1 0 
ii) The matrix 
3 0 0 
A=1|0 3 0 
1 0 3 
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has a unique eigenvalue \ = 3 with geometric multiplicity m = 2, in fact v1, = 
(0,1,0)7 and v2 = (0,0,1)7 are linearly independent eigenvectors associated to 
it. The system 


(A -3D)r=v11 
has no solutions, whereas 

(A —3D)r = vi2 
gives, for instance, ry) = (10,0) g 
Hence, the general integral of (10.62) is 


0 0 0 1 
y(t) =cye"* | 1 | +cege | 0 | + ese |e| 0) +10 
0 1 1 0 
iii) Consider 
1 5 O 
A=1{0 1 0O 
4 0 1 


with one eigenvalue \ = 1 and one eigenvector v1; = (0,0,1)? (geometric mul- 
tiplicity m = 1). As the set of all eigenvectors is a basis of R?, equations (10.72) 
will spawn two generalised eigenvectors. In fact 


(A-Dri? =o 
(A-Dr{? =rt? 


are solved by ri) = (1/4,0,0)7 and ro) = (0,1/20,0)?. The general integral 
reads 


0 0 1/4 
y(t) =cye’ | O | +ene’ Jt | O] +] O + 
1 1 0 
E 0 1/4 0 | 
tezse' |-t? | O] +¢{ O | + { 1/20 
2 Ay 0 0} | 


Explanation. All the results about eigenvectors, both proper and generalised, 
can be proved starting from the so-called Jordan canonical form of a matrix, 
whose study goes beyond the reach of the course. 

What we can easily do is account for the linear independence of the maps 


wh) (t) of Proposition 10.35. Taking to = 0 in Proposition 10.28 gives 


h h 
wo) =n 


for any k, £,h, so the result follows from the analogue property for A. 
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10.6.3 Non-homogeneous systems 


In the light of Proposition 10.29, a non-homogeneous equation 
y’ = Ay + b(t) (10.74) 


can be solved by simply finding a particular integral yp. 

The method of variation of parameters, and especially formula (10.61), provides 
us with a general means. However, in many situations the source term b(t) can be 
broken into elementary functions like exponentials, polynomials or trigonometric 
maps. Then a particular integral of the same elementary type is usually easy to 
find. 

Henceforth we shall call a polynomial any function p(t), q(t),... from R to R” 
whose components are real algebraic polynomials; a polynomial has degree m if 
the maximum degree of the components is m, in which case it can be written as 


q(t) = egt™ + yt™ 14+... + emit +em, 


with c; € R” and cp £ O. 
Let us suppose that 


b(t) = e' p(t) (10.75) 


with a € R and p(t) a degree-m polynomial. Then there exists a particular integral 


yp(t) = e*qi(t), (10.76) 


where q(t) is a polynomial of degree 


e mifa is not an eigenvalue of A, 
e m+yp if qa is an eigenvalue of algebraic multiplicity u > 1. 


Borrowing from Physics, one refers to the latter situation as resonance. 

The undetermined coefficients of q(t) are found by substituting (10.76) in equa- 
tion (10.74), simplifying the exponential terms and matching the coefficients of the 
corresponding powers of t. This produces a series of linear systems with matrix 
A—al that allow to determine solutions co, €1,..., Cm+, Starting from the highest 
power of t. In case of resonance, A — aT is singular; then the first system just says 
that co is an eigenvector of A, and the other systems require compatibility condi- 
tions to be solved. Example 10.38 ii) illustrates a situation of this type, detailing 
the computation to be performed. 


If the source term looks like 


b(t) =e“ p(t) coswt or b(t) =e p(t) sinwt , (10.77) 
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with a € R, p(t) a polynomial of degree m, w # 0, then a particular integral will 
be 


yp(t) = e™ (qi(t) cos wt + go(t) sin wt) (10.78) 


with qi(t), q2(t) polynomials of degree 


e mifa+iw is not eigenvalue of A, 
e m+yp if a+ iw is an eigenvalue of algebraic multiplicity p> 1. 


The latter case is once again called of resonance. The polynomials’ undetermined 
coefficients are found as before, preliminarly separating terms with coswt from 
those with sin wt. 
Examples 10.38 

i) Find a particular integral of (10.74), where 


0 -1 0 0 
A=/1 0 0 and b(t) = | t 
i 0 2 a 


Referring to (10.75), a = 0, p(t) = bot? + bit with bo = (0,0,1)7, b; = (0,1, 0)" 
(and by = O). Since a is not a root of the characteristic polynomial y(A) = 
(2 — \)(\? + 1), we look for a particular integral 


Yp(y) = q(t) = cot? + eit + co. 
Substituting in (10.74), we have 
2Cot +e, = Acot? + Act + Aco + bot? + byt 5 


so comparing terms yields the cascade of systems 


Aco = —bo 
Ac, = 2Co = bi 
Aco = Cc). 


These give co = (0,0, -1/2)7, ce; = (—1,0,0)? and cz = (0,1, 0)", and a partic- 
ular integral is then 


0 = 0 = 
y,(t) = 0 |t?7+{ 0 Jt+{1]= 1 
—1/2 0 0 3? 


ii) Consider equation (10.74) with 


a=(3 =) and w(t) =e (4). 


As a = 1 is a root of the characteristic polynomial y(A) = (A—5)(A— 1), we are 
in presence of resonance, so we must try to find a particular integral of the form 


Yp(t) =e'(cot + c1). 
Substituting and simplifying, we have 
Co + Cot + €; = Acot + Ac + bo 
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where bp = (1,0)7, and so 
(A = T)co = 0 
Nene = co — bo. 

The first system yields an eigenvector co = (y, 27)? associated to A = 1, for some 
y € R. This constant y is fixed requiring the second system to be consistent, 
meaning Co — bo is a linear combination of the columns of A — J. But that 
matrix has rank 1 since the columns are linearly dependent, so the condition is 
that co — bo is a multiple of another column, (y— 1,27)? = k(1,1)7; this implies 
y—1=2y, so y = —1. Substituting gives eg = (—1,—-2)7 and ce; = (1,5/2)7, 
for example. In conclusion, a particular integral has the form 


yp(t) = et (<3) t+ @)) =e! ex ; 


iii) Find a particular integral of (10.74) with 


0 -l 0 1 
A=|{1 0 0 and b(t) = | O | sin 2t. 
0 1 -l 0 


Referring to (10.77), now a = 0, p(t) = (1,0,0)7 = po and w = 2. The complex 
number a + iw = 27 is no eigenvalue of A (these are —1, +7), so (10.78) will be 


Yp(t) = qi cos 2t + qe sin 2t 
with qi,q2 € R®. Let us substitute in (10.74) to get 
—2qi sin 2t + 2q2 cos 2t = Aq) cos 2t + Age sin 2t + po sin 2t; 
comparing the corresponding terms we obtain the two linear systems 
{ —2q1 = Age + Po 
2q2 = Aqi, 


solved by qi = (—2/3,0,1/6)7, q2 = (0,—-1/3,—-1/12)?. We conclude that a 
particular integral is given by 


—2/3 0 1 8 cos 2t 
ta(t) = 0 cos2t— | 1/3 | sin2t¢= a5 4 sin 2t 
1/6 1/12 sin 2t — 2 cos 2t 


The superposition principle 

Ultimately, suppose b(t) is the sum of terms like (10.75) or (10.77). By virtue of 
linearity, a particular solution y, will be the sum of particular solutions of the 
single summands: 


if b= bi + bo+...+ bx, and ypx solves y’ = Ay + bd, fork =1,...,K, 


then Yp = Ypi +... + Ypx solves y’ = Ay + b. 
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In fact, 


Yn = Up +--+ Up = (Aypi + b1) +... + (AypK + bx) 
= A(Ypi +... + YpK) + (b1 +... + bx) = Ay, t+ b. 


The property is known as superposition (or linearity) principle. Example 10.42 ii) 
will provide us with a tangible application. 


10.7 Linear scalar equations of order n 


In this section we tackle linear scalar equations 


y™) + ayy) +... + an_i1y' tony = bf), (10.79) 


where n is an integer > 2, the coefficients a1,...,@, are real constants and b is a 
real-valued continuous map defined on the real interval J. We abbreviate by Ly 
the left-hand side; the operator L: y +L y is linear because differentiation is a 
linear operation. Then 


fy = y™ ayy") +... + an-1y' tany =0 (10.80) 


is called the homogeneous equation associated to (10.79). 

The background theory can be deduced from the results on linear systems 
of first-order equations, settled in the previous section. In fact, we remarked in 
Sect. 10.2 that any ODE of order n is equivalent to a system of n differential 
equations of order one. In the case at hand, we set y,;(t) = y°~)(t), 1 <i <n, so 
that (10.79) is equivalent to the linear system 


Yi = Y2 
Y2 = ¥B 
Yn = —Q1Yn — -.. — Gn—1y2 — any + O(E) ; 


the latter may be written as (10.48), and precisely 


y = Ay+O(t), (10.81) 
by putting 
iL 
a 0 0 1 0 
y=|.], A= , bt)= : 
; ee dvd 0 O 1 
Yn —an —An-1 Ae wa te —a1 o(t) 


The first component of y is the solution of (10.79). 
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The homogeneous equation 
Equation (10.80) corresponds to the homogeneous system relative to (10.81). From 
this we recover the following central result. 


Proposition 10.39 i) The set So of solutions to the homogeneous equa- 
tion (10.80) is an n-dimensional vector space. 


it) The set Sp of solutions to (10.79) is the affine space Sp = Yp + So, where 
Yp 18 any solution. 


Proof. i) So is a vector space because (10.80) is a linear constraint. With Propos- 

ition 10.26 in mind, let w 1,...,w, be a fundamental system of solutions 
for y' = Ay. Denote by z; = w;- e; the first components of the vectors, 
which clearly solve (10.80). 
If y € So, the associated vector-valued map y is a linear combination 
of the w,;; in particular its first component is a combination of the 2;. 
Thus So is spanned by those solutions. The claim follows provided we 
show that the functions z1,..., Zn are linearly independent. By successively 
differentiating 


> cn =o, Vite R, 
i=1 


we get, forall l<k<n-1, 
5 on" @ =, VEER. 
i=1 
But zh*) is the (k+1)th component of w;, so we obtain the vector equation 
Si cwi(t)=0, WeER, 
i=1 


whence c; = ... = Cn = 0 by linear independence of the w,. 


ii) The argument is similar to the one used in Proposition 10.29. O 


In order to find a basis for So, the linear system associated to (10.80) is not 
necessary, because one can act directly on the equation: we are looking for solu- 
tions y(t) = e*', with constant, possibly complex. Substituting into (10.80), and 
recalling (10.28), gives 


Le) =(A" GO acd ipa Gave” =0, 


i.e., 
Le“) = x(A)e* =0, 
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where x(A) = A” +a,A"~1+...+4n—-1A+4n is the characteristic polynomial of 
the ODE (10.79). Since e* is always non-zero, y(t) = e*” solves the homogeneous 
equation if and only if A is a root of the characteristic polynomial, that is if and 
only if \ satisfies the characteristic equation 


XN + ara" $F an 1A +n = 0. (10.82) 


Hence, the differential problem is reduced to a purely algebraic question, that the 
Fundamental Theorem of Algebra can handle. We know that (10.82) has p distinct 
solutions \j,...,Ap, 1 <p <n; each root Ax, 1 < k <p, has multiplicity uw, > 1, 
so that overall j11 +...+ Up =n. The characteristic equation’s zeroes are nothing 
but the eigenvalues of A in (10.81), because one could prove 


det(A — AD) = (-1)"x(A). 


This gives directly p distinct solutions 


to (10.80). If p < n, any root Ax, of multiplicity 4, > 1 gives uw, —1 further solutions 


Nee WP) Nr k—1 Ant 
ne he ned eee 


ake 


The n solutions thus found can be proven to be linearly independent. In summary, 
the result reads as follows. 


Proposition 10.40 The functions 
pl =e ep Ol al 


form a basis for the space So of solutions to (10.80). Equivalently, every solu- 
tion of the equation has the form 


Git) =e gu(iers = 
=I 


with qz a polynomial of degree < px, — 1. 


In presence of complex(-conjugate) roots of equation (10.82), the corresponding 
basis functions are complex-valued. But we can find a real basis if we replace the 
pair t’e>*", te with the real and imaginary parts of either of them, for any pair 
of complex-conjugate eigenvalues Ax = o% + Wr, Xr = OK — iwy. That is to say, 


t®e7** cos wyt and te7** sin wypt 


are n linearly independent, real solutions. 
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Examples 10.41 
i) Let us make the previous construction truly explicit for an equation of order 
two, 
y” + ary’ +agy =0. (10.83) 

Let A = a? — 4az be the discriminant of \? + a, + a2 = 0. 
When A > 0, there are two distinct real roots Ay2 = (—a; + VA)/2, so the 
generic solution to (10.83) is 

y(t) = c,e™* +c". (10.84) 
When A = 0, the real root Ay = —a;/2 is double, 41 = 2; so, the generic solution 
reads 


Ait 


y(t) = (c1 + cot) e™*. (10.85) 

Eventually, when A < 0 we have complex-conjugate roots Ay = 0+iw = —a/2+ 
i/|A] and Ag = o — iw = —a/2 —iy/JAl, and the solution to (10.83) is 

y(t) =e” (cy coswt + cosinwt). (10.86) 


ii) Solve the homogeneous equation of fourth order 
yO +y=0. 
The characteristic zeroes, solving \*4 + 1 = 0, are fourth roots of —1, A1,2,3,4 = 


v2 ( 1 +7). Therefore y takes the form 


2 Z 2 2 
y(t) = e(V2/2) *(e1 cos ee + c2 sin v,) +e (¥2/2) “(cs cos ee + c4sin ey ; 


The non-homogeneous equation 
Just as for systems of order one, a particular integral is easy to find when b(t) has 
a special form. For instance, for 


b(t) = e“ p(t) (10.87) 


with a € R, p(t) an algebraic polynomial of degree m with real coefficients, there 
is a particular integral 


Yp(t) = eth g(t) , (10.88) 


where ps > 0 is the multiplicity of a as a zero of x(A) = 0 (with uw = 0 if a is not 
a root, while yz > 1 gives resonance), and q(t) is a polynomial of degree m with 
unknown coefficients. These coefficients are determined by substituting (10.88) 
in (10.79), simplifying the common factor e® and comparing the polynomial func- 
tions in f. 

Take another example, like 


b(t) = e“ p(t) cos wt or b(t) = e p(t) sinwt , (10.89) 
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with a € R, p(t) a real algebraic polynomial of degree m, w 4 0. This gives a 
particular integral 


yp(t) = et" (q(t) coswt + go(t) sinwt) , (10.90 
P 


where py > 0 is the multiplicity of a + iw, and q1,q2 are unknown polynomials of 
degree m to be found as above, i.e., separating the terms in coswt and sinwt. 

The superposition principle is still valid if b(t) is a sum of terms b;,(t) like (10.87) 
or (10.89): a particular integral for (10.79) will be a sum of particular integrals for 
the b; Gap 


Examples 10.42 


i) Determine the general integral of 

y =i —tyse 
The characteristic equation \? — \ — 6 = 0 has roots \; = —2, A2 = 3, so the 
general homogeneous integral is 

Yhom(t) = cye~ 7? + cge**. 

In a situation of resonance between the source term and a component of Ynom, 
the particular integral has to be of type 

Yp(t) = t(at + bye ** 
We differentiate and substitute back in the ODE to obtain 

—10at+2a-—5b=¢ 
(the coefficient of t? on the left is zero, as a consequence of resonance), from 
which —10a = 1 and 2a — 5b = 0, so a = —1/10, b = —1/25. In conclusion, 

y(t) = ( — at — st + ci )e™ + coe. 

ii) Let us find the general integral of 


mw 


y+ y' =cost — 2e*. 
By superposition 
y(t) = Ynom(t) + Yp, (t) — 2Yp2(t) ; 
where Yhom is the general homogeneous integral, y,, and yp, are particular in- 


tegrals relative to sources cost and e®’. 
The characteristic equation A? + 4 = 0 has roots A; = 0, A2,3 = =i, hence 


Yhom(t) = c1 + cg cost +cgsint. 


Taking resonance into account, we want yp, of the form yp, (t) = t(acost+bsint). 


Computing successive derivatives of y,, and substituting them into 
th 


Wil + yp, cost 
gives —2acost — 2bsint = cost, so a = —1/2 and b = 0. Therefore yp, (t) = 


1 
— 5 Cos t. 
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Now we search for yp, of the form y,,(t) = de®’. By differentiating and substi- 
tuting in the equation 


m / Bt 
Yoo + Up, = & y 


we obtain d = 1/30, and then yp, (t) = ye". All-in-all, 


u 1 
y(t) =e. + (co = 5¢) cost + cg sint — = 


is the general integral of the given equation. 


10.8 Stability 


The long-time behaviour of solutions of an ODE is a problem of great theoretical 
and applicative importance. A large class of dynamical systems, i.e., of systems 
whose state depends upon time and that are modelled by one or more differential 
equations, admit solutions at any time ¢ after a given instant tp. The behaviour 
of a particular solution can be very diversified: for instance, after a starting trans- 
ition where it depends strongly on the initial data, it could subsequently converge 
asymptotically to a limit solution, independent of time; it could, instead, present 
a periodic, or quasi-periodic, course, or approach such a configuration; a solution 
could even have an absolutely unpredictable, or chaotic, behaviour in time. 

Perhaps more interesting than the single solution is though the behaviour of 
a family of solutions that differ by slight perturbations either of the initial data, 
or of the ODE. In fact, the far future of one solution might not be representative 
of other solutions that kick off nearby. Often the mathematical model described 
by an ODE is just an approximation of a physically-more-complex system; to 
establish the model’s reliability it is thus fundamental to determine how ‘robust’ 
the information that can be extracted from it is, with respect to the possible errors 
of the model. In the majority of cases moreover, the ODE is solved numerically, and 
the discretised problem will introduce extra perturbations. These and other reasons 
lead us to ask ourselves whether solutions that are initially very close stay close at 
all times, even converge to a limit solution, rather than moving eventually apart 
from one another. These kinds of issues are generically referred to as concerning 
(asymptotic) stability. The results on continuous dependency upon initial data, 
discussed in Sect. 10.4.1 (especially Proposition 10.15), do not answer the question 
satisfactorily. They merely provide information about bounded time intervals: the 
constant e”!*-tol showing up in (10.36) grows exponentially from the instant to. A 
more specific and detailed analysis is needed to properly understand the matter. 

We shall discuss stability exclusively in relationship with stationary solutions; 
the generalisation to periodic orbits and chaotic behaviours is incredibly fascinat- 
ing but cannot be dealt with at present. 


Let us begin with the Cauchy problem 


oe aad t>to, 


y(to) = Yo, ee 
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(t, y(t, yo)) 


(t, Yo) 


Figure 10.9. Lyapunov stability 


and suppose there is a Yo, belonging to D, such that f(t, yo) = 0 for any t > to. 
Then y(t) = Yo, Vt > to, is a constant solution that we shall call stationary solu- 
tion. The point yo is said critical point, stationary point, or equilibrium 
point for the equation. A further hypothesis will be that the solutions to (10.91) 
are defined at all times t > to, whichever the initial datum yo in a suitable neigh- 
bourhood B of Yo; from now on y(t, yo) will denote such a solution. Therefore, it 
makes sense to compare these solutions to Yo over the interval [to, +00). To this 
end, the notions of (Lyapunov) stability and attractive solution are paramount. 


Definition 10.43 The stationary solution yo = y(t, Yo) to problem (10.91) 
is stable if, for any neighbourhood Bz(Yo) of Yo, there exists a neighbourhood 


Bs (Yo) such that 


yo€ BNB.-(yo) => ylt,yo) € Bs(yo) Vt > to. 


This is exemplified in Fig. 10.9. 


Definition 10.44 The stationary solution Yo is called attractive (or an 
attractor) if there exists a neighbourhood B(yo) C B such that 


yo€ Biyo) = jim y(t, yo) = Yo, 
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and uniformly attractive (a uniform attractor) if the above limit is uni- 
form with respect to yo, 1.e., 


lim sup _|ly(t, yo) — Yo) || = 0. 
E> +00 yo € B(Ho) 


The two properties of stability and attractiveness are unrelated. The point Yo is 
uniformly asymptotically stable if it is both stable and uniformly attractive. 


Example 10.45 


The simplest (yet rather meaningful, as we will see) case is the autonomous 
linear problem 
ia >, AER, 
y(to) = Yo: 


whose only stationary solution is jp = 0. The solutions y(t, yo) = e*yo confirm 
that 0 is stable if and only if A < 0 (in this case we may choose 6 = ¢ in the 
definition). Moreover, for \ = 0 the point 0 is not an attractor (all solutions are 
clearly constant), whereas for \ < 0, the point 0 is uniformly attractive (hence 
uniformly asymptotically stable): for example setting B(0) = Bi(0) = (—1,1), 
we have 
sup y(t, yo)| =e’? 3 0 as £ —> +00. 
yo€ B(0) 

As far as stability is concerned, we obtain similar results when  € C (complex- 
valued solutions) provided we replace \ with Re X. 


This example generalises directly to autonomous linear systems, now examined. 


10.8.1 Autonomous linear systems 


Suppose f(t, y) = f(y) = Ay +b has A € R™” and b € R” independent of time. 
A stationary solution of (10.91) corresponds to a solution of the linear system 
Ay = —b (unique if A is non-singular). If Yo is one such solution the variable 
change 2 = y — Yo allows us to study the stability of the zero solution of the 
homogeneous equation z’ = Az. There is so no loss of generality in considering, 
henceforth, the stability of the solution y(t,0) = 0 of the homogeneous problem 


a 0. 


y(to) = Yyo- _ 


From Sect. 10.6, in particular Proposition 10.35, we know every solution y(t, yo) 
is a linear combination of 


w(t) = e™p(t), 
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where A € C is an eigenvalue of A, and p(t) is a vector-valued polynomial in t 
depending on the eigenvector (or one of the eigenvectors) associated to 4. In case 
the algebraic and geometric multiplicities of \ coincide, every p(t) associated to 
it has degree 0, and so is constant; otherwise, there exist polynomials of positive 
degree, which therefore tend to oo as t + +00. 


Consequences: 


e if every eigenvalue of A has negative real part, then all basis elements w(t) 
tend to 0 as t —> +00 (recall e7't® — 0 for t > +00 if o = Red < 0, for any 
8); 

e if all eigenvalues of A have negative or zero real part, and the latter ones 
have coinciding algebraic and geometric multiplicities, all w(t) are bounded on 
[0, +00); 

e if there are eigenvalues of A with positive real part, or zero real part and 
algebraic multiplicity greater than the geometric multiplicity, then some w(t) 
tends to oo as t > +00. 


How this translates in the language of stability is easily said. 


Proposition 10.46 a) The origin y(t,0) = 0 is a stable solution of (10.92) 
if and only if all eigenvalues A of A satisfy Re A < 0, and those with Re X = 0 


have the same algebraic and geometric multiplicity. 
b) The origin is a uniformly attractive solution (hence, uniformly asymptot- 
ically stable) if and only if all eigenvalues of A satisfy RerA < 0. 


We shall investigate now all the scenarios for systems of two equations. 


10.8.2 Two-dimensional systems 


a= (0 4) 


has determinant detA = ad — bc and trace trA = a+ d. Its eigenvalues are roots 
of the characteristic polynomial y(A) = A? — trA + detA, so 


trA + (trA)? —AdetA 
inn, nn 
If (trA)? 4 4detA, then the eigenvalues are distinct, necessarily simple, and the 
matrix is diagonalisable. If, instead, (trA)? = 4detA, the double eigenvalue \ = 
trA/2 has geometric multiplicity 2 if and only if b =c=0, ie., A is diagonal; if 
the multiplicity is one, A is not diagonalisable. 


The generic 2 x 2 matrix 


A 


Let us assume first A is diagonalisable. Then Proposition 10.32 tells us each 
solution of (10.92) can be written as 


y(t, Yo) = 21(t)v1 + z2(t)v2, (10.93) 
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where v1, U2 are linearly independent vectors (in fact, they are precisely the ei- 
genvectors if the eigenvalues are real, or two vectors manufactured from a complex 
eigenvector’s real and imaginary parts if the eigenvalues are complex-conjugate); 
zi(t) and z(t) denote real maps satisfying, in particular, z1(0)v1 + z2(0)ve = yo. 

To understand properly the possible asymptotic situations, it proves useful to 
draw a phase portrait, i.e., a representation of the orbits 


P(y) = {y(t, yo) = (yi(t), ya(t)) + t 2 OF 


on the phase plane R? of coordinates y;, yz. We will see straight away that it is 
possible to eliminate t and obtain an explicit functional relationship between y1 
and y2. Actually, it will be better to perform such operation on the variables z1 
and zg first, represent the orbit I\(z) = {(z1(t), z2(t)) : t > 0} in the phase plane 
2122, and then pass to the plane y;y2 using the linear transformation (10.93). 


We have to distinguish six cases. 


i) Two real non-zero eigenvalues with equal sign: Ax < Ay <0 or 0< ry < Ag. 


Then 
zy (t) = ie ; Z(t) = dye?" ; 


with d,, dz dependent on yo. If dj = 0 or dz = 0, the orbits lie on the coordinate 
axes 21 = 0 or zo = O. If neither is zero, 


A2 
21(t) \ 41 
z(t) =f (e*"") x7 _ do ( 1( ’) 
dy 
so the orbits are graphs of 
Laz, with a > 1 
22 4 Y2 4 
NZ 
en = > , a yt 

IN 


Figure 10.10. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case A2 < A1 < 0 (node) 
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Figure 10.11. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case Az < A1 = 0 


(which are half-lines if 4; = 2.) Points on the orbits move in time towards the 
origin, which is thus uniformly asymptotically stable, if the eigenvalues are negative 
(Fig. 10.10, left); the orbits leave the origin if the eigenvalues are positive. On the 
plane y1y2 the corresponding orbits are shown by Fig. 10.10, right, where we took 
U= (3.1); v2 = (—1,1). 

The origin is called a node, and is stable or unstable according to the eigen- 
values’ signs. 


ii) Two real eigenvalues, at least one of which is zero: 

Ag <A, =0 or O= Aq < Ao. 
The function z;(t) = d, is constant. Therefore if Ag # 0, the orbits are vertical 
half-lines, oriented towards the axis z2 = 0 if A2 < 0, the other way if Ao > 0 
(Fig. 10.11). If A2 = 0, the matrix A is null (being diagonalisable), hence all orbits 
are constant. Either way, the origin is a stable equilibrium point, but not attractive. 
iii) Two real eigenvalues with opposite signs: 42 <0 < A1. 


The z,Z2-orbits are the four semi-axes, together with the curves 
lo = 027 with a < 0 


(hyperbolas, if a = —1), oriented as in Fig. 10.12. 
The origin, called a saddle point, is neither stable nor attractive. 


iv) Two purely-imaginary eigenvalues: X= tiw,w £0. 
As 
z1(t) = d1 cos(wt + de) and zo(t) = di sin(wt + dz), 


the orbits are concentric circles on the phase plane z12z2, and concentric ellipses on 


yiy2 (Fig. 10.13). 
The origin is called a centre, and is stable but not attractive. 
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Y 
< ° > ae i 
uN 


Figure 10.12. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case Az < 0 < A; (saddle point) 


v) Complez-conjugate eigenvalues, with non-zero real part: X =o + iw, with 
weDands <0 ore > 0. 
From 


21(t) = de” cos(wt + dz) and z(t) = dye“ sin(wt + dz), 


we see the orbits spiralling towards the origin if o < 0, and moving outward if 
o > 0 (Fig. 10.14). In the former case the origin is uniformly asymptotically stable. 
The origin is a focus, stable or unstable according to the sign of o. 
The last case occurs for non-diagonalisable A, in other words if there is a double 
eigenvalue with geometric multiplicity equal 1. Then v1 is the unique eigenvector 
in (10.93), and v2 is the associated generalised eigenvector, see (10.71). 


a2 4 Y2 4 


Figure 10.13. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case \ = +w (centre) 
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Z1 Y1 


Figure 10.14. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case \ = o + w (focus) 


vt) One real double eigenvalue , of geometric multiplicity 1. 


We have 
z(t) — ie ; z(t) = e?# (dy ++ dt) : 


When A = 0, and d; ¥ 0, the orbits are vertical straight lines as shown in Fig. 10.15; 
for d, = 0 the orbits are fixed points on z; = 0. Therefore, the origin is neither 
stable, nor attractive. 


When \ + 0 instead, ¢t can be written as t = 4 log 


By , whence 


21 
dy 


24> oh 2. P5 


21 
dy 


-2 4 
VVVY FAK A 
5 > 
Z1 
VVVY LAR 4 


Figure 10.15. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case Ai = A2 = 0, bc £0 
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22 Yy2 
ry 


Z1 Y1 


Figure 10.16. Phase portrait for y’ = Ay in the 21 z2-plane (left) and in the yi y2-plane 
(right): the case A1 = A2 # 0, bc £ 0 (improper node) 


Figure 10.16 shows the orbits oriented to, or from, the origin according to whether 
A <0, A > 0. The origin is uniformly asymptotically stable, and still called (de- 
generate, or improper) node; again, it is stable or unstable depending on the 
eigenvalue sign. 


Application: the simple pendulum (IV). The example continues from p. 454. 
The dynamical system y’ = f(y) has an infinity of equilibrium points yo, because 


The solutions 0(t) = é7, with @ an even number, correspond to a vertical rod, with 
P in the lowest point S (Fig. 10.3); when @ is odd the bob is in the highest position 
I. It is physically self-evident that moving P from S a little will make the bob 
swing back to S; on the contrary, the smallest nudge to the bob placed in J will 
make it move away and never return, at any future moment. This is precisely the 
meaning of a stable point S and an unstable point I. 

The discussion of Sect. 10.8.2 renders this intuitive idea precise. Consider a 
simplified model for the pendulum, obtained by linearising equation (10.12) around 
an equilibrium position. 

On a neighbourhood of the point @ = 0 we have sin@ ~ 0, so equation (10.12) 
can be replaced by 


d?6 dé 

— —+kéd=0 10.94 

a de ; ud 
which describes small oscillations around the equilibrium S$; the value a = 0 


gives the equation of harmonic motion. The corresponding solution to the Cauchy 
problem (10.13) is easy to find by referring to Example 10.4. Equivalently, the 
initial value problem assumes the form (10.92) with 


A= @ “) = Jf(0,0), 
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= a —Ak 


whose eigenvalues are \ = . Owing to Sect. 10.8.2, we have that 


if a? > 4k, the origin Yo = (0,0) is a uniformly asymptotically stable node 
[cases 7) or vi)] ; 


if 0 < a? < 4k, the origin is a uniformly asymptotically stable focus [case 


v)|; 


if a = 0, the origin is a centre [case iv)] . 


Whatever the case, the bottom position S' is always stable. 
Linearising (10.12) around the equilibrium 6 = 7, and changing variables y = 
6 — x (so that sin? = — sin y) produces a problem of the form (10.92), with 


A= (; -) = Ff(n,0). 


Ree oe: 
; =e + 4k — ’ 

The eigenvalues \ = Sa are always non-zero and of distinct sign. The 

point Yo = (7,0) is thus a saddle [case iii)], and so unstable. This substantiates 

the claim that J is an unstable equilibrium. 

The discussion will end on p. 488. Oo 


10.8.3 Non-linear stability: an overview 


Certain stability features of linear systems are inherited by non-linear systems, 
thought of as deformations of linear ones. Suppose the function f(t, y) appearing 
in (10.91) has the form 


fltiy) =Aytglt,y), (10.95) 
with g continuous and such that 


g(t, y) = o(|ly||) as yO uniformly in t; (10.96) 


this means there exists a continuous map @ : R, — Ry such that ¢(s) > 0, 
s— 0, and 


lg wil < o(lyDiyll,  Vye BO), Vt>to, 


on a neighbourhood B(O) of the origin. Then, the origin is an equilibrium for 
equation (10.91). What can we say about its asymptotic stability? One answer is 
given by the following fact. 


Theorem 10.47 Let f be defined by (10.95), with g as in (10.96). 
a) If all eigenvalues of A have strictly negative real part, the origin is a 


uniformly asymptotically stable equilibrium for (10.91). 
b) If there is an eigenvalue of A with strictly positive real part, the origin is 
unstable. 


488 10 Ordinary differential equations 


Under the given hypotheses the properties of the non-linear system y’ = Ay + 
g(t, y) are the same of the corresponding linear one y’ = Ay; the latter is nothing 
but the linearisation, around the origin, of the former (compare Remark 10.25). 
We cannot say much more if all eigenvalues have non-positive real parts and some 
are purely imaginary: stability in this case depends upon other properties of g. 

Yet, the theorem has an important consequence for autonomous systems 
around an equilibrium Yo. The criterion is known as Principle of linearised 
stability. 


Corollary 10.48 Let f : D C R” > R” be aC! map, Yo € D such that 
f(Yo) =90, and A= Jf(Yo) the Jacobian of f at Yo. Then the autonomous 
system y’ = f(y) has the following stability properties. 


a) If all eigenvalues of A have negative real part, Yo is a uniformly asymp- 
totically stable equilibrium for (10.91). 
b) If there is an eigenvalue of A with positive real part, Yo is unstable. 


Proof. Using the Taylor expansion (5.16) at Yo, 


f(y) = F(Yo) + JF (Yo)(y — Yo) + g(y) = A(y — Yo) + gy), 


with g(y) = o(|ly — Yoll) as y > Yo. The variable change z = y — Yo 
puts us in the hypotheses of the previous theorem, thus concluding the 
proof. Oo 


These two results are local in nature. Global information concerning the stabil- 
ity of a stationary point can be obtained, for autonomous systems, from the know- 
ledge of a first integral (Sect. 10.4.5), or a Lyapunov function (Example 10.22 ii)). 

A conservative system corresponding to an equation like (10.42) or (10.46) 
admits a first integral, namely the total energy E; therefore, by a theorem due 
to Lagrange, we can say that if the origin is a strict minimum for the potential 
energy II, it must be a stable equilibrium for the system. 

The same conclusion follows if the origin is stationary for a system admitting a 
Lyapunov function V. Furthermore, if the derivative of V along any trajectory is 
strictly negative (with the exception of the origin), then the origin is an attractor. 


Application: the simple pendulum (V). The results about the linearised equa- 
tion (10.94) determine the stability of the stationary points of (10.12) in presence 
of damping. As a consequence of linearised stability (Corollary 10.48), if a > 0 the 
matrix J f(yo) = A has two eigenvalues with negative real part for yo = (0,0), and 
one eigenvalue with positive real part for Yo = (a, 0). As in the linearised problem, 
the bottom equilibrium point S is uniformly asymptotically stable, whereas the 
top point J is unstable. Figure 10.17 zooms on the phase portrait. Multiplying 
equation (10.12) by $8 gives 


d do\” 
= —_— — eS < 
E08") a(Z) so. 
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Figure 10.17. Trajectories in the phase plane for the dampled pendulum (restricted to 
lyi| < a). The shaded region is the basin of attraction to the origin. The points lying 
immediately above (resp. below) it, between two orbits, are attracted to the stationary 
point (27,0) ((—27,0)), and so on. 


making the energy a Lyapunov function, which thus decreases along the orbits 
(compare with Fig. 10.8). 

For a free undamped motion, equilibria behave in the same way in the linear and 
non-linear problems; this, though, can be proved by other methods. In particular, 
the origin’s stability follows from the the fact that it minimises the potential energy, 
by Lagrange’s Theorem. Oo 


10.9 Exercises 


1. Determine the general integral of the following separable ODEs: 


2 
f= oy 2) = oS Toei 
2. Tell what the general integral of the following homogeneous equations is: 
At?y! = y? + 6ty — 30? b) #2y/ — yet/Y = ty 
3. Integrate the following linear differential equations: 


490 10 Ordinary differential equations 


Find the general integral of the Bernoulli equations: 
iL 


1 ¢ 
a)y = =y-y’ b) y’ = =y+-logt 
t t Yy 


Determine the particular integral of the ODE 


, Loe? 
| 


subject to the condition y(0) = 1. 
Establish if there are solutions to 
y =—-2yt+e 7 
with null derivative at the origin. 
7. Solve, on [¥/e, +00), the Cauchy problem 


ey’ = 4t? logt(1 + e¥) 
y(e) = 0. 


8. Solve on the interval (—2, 2) the initial value problem 


9. Given the ODE 
yo: TT 
y sin 2t — 2(y+cost) = 0, te (0. =): 
determine the general integral and write a solution that stays bounded as 


ton. 


Solve the Cauchy problem 
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11. Solve the Cauchy problem 


13. Determine the particular integral of the differential equation 
yy” —(y')? = y? logy 
such that y(0) = y'(0) = 1. 
As a varies in R, solve the differential equation 
y’ = (2+a)y — 2e* 
with initial datum y(0) = 3. 
Let a, b be real numbers. Solve 


poe 


y(2) =1 
on the interval [2, +00). 


Given the ODE 
y (t) = —3ty(t) + kt 


depending on the real number k, find the solution vanishing at the origin. 


17. Given , 
,_ y ~2y—3 
2(14+ 4t) ’ 
a) determine its general integral; 
b) find the particular integral yo(t) satisfying yo(0) = 1; 


A491 


Given the ODE y’ = f(t,y) = /t+y, determine open sets 2 = I x D = 


(a,6) x (7,6) inside the domain of f, for which Theorem 10.14 is valid. 
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Find the maximal interval of existence for the autonomous equation 


y' = fly) 
with: 
a) f(y) = V/1lt+yi+yeitarctan(y: + y2)j on R? 
2 
sin ||y 7 
b) f(y) = ll y 


~ log?(2-+ Ilyll?) 
Verify that the autonomous equation y' = f(y), with 
Y2 : Y1 ; 
a = ————_i- —— 
) FO) = Taye Tey aye? 
b) f(y) = (4ytys + 2yry2)é — (2yys + y3)I 


admits on R? a first integral, then compute it. 


21. Determine the general integral of the system y’ = Ay, where: 


) A=(5 75) b 4=(4 4) 


13: 0: —4 a 
c)| A= 15 DB “25 d)|A= 32 -6 
30 0 +8 Lo 7 
0 1 0 4 0 0 
ec) A=]0 0 1 f)|A=[0 -2 9 
0 =—10. <6 0 4 -2 
3.0 0 1 ‘5 0 
2) A=[(0 3 0 h) A=/0 1 0 
1 3 401 
22. Determine a particular integral of the systems: 
0 =1 0 0 
a) y=[{[1 0 O)}yt{t 
1 0 2 i 
=i =i 0 1 
b) y ={1/8 O -1l]y+ {1]e” 
0 1/8 -1 0 


0 
c) y ={1 0 O } yt] 0 | sin2¢ 
0 1 -!1 0 
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23. Solve ; 
Yi = 2y1 + ¥3 
Yo = YB 
ys = 8y1 


with constraints y1(0) = y2(0) = 1 and y3(0) = 0. 
Find the solutions of 
{ Yi + Yo = Bye 
3y1 — 245 = 5y1 
with initial data y1(0) = 2, yo(0) = —1. 


Determine, in function of the real number b, the solutions of 
p-. f=). 8 


26. As the real parameter a varies, find the general integral of 


with y(0) = (1,1)?. 


Id 0 1 
y =|[a-2 3a-1 1-aly. 
0 0 a 


Solve the system 
{ ge =IWe-—y 
y” = —10x + 5y — 2y’. 


28. Write the general integral for the following linear equations of order two: 


a) y” + 3y' +2y=t2 +1 y” — dy’ + dy = &2* 
y’ +y = 3cost d) y” —3y'+2y=e° 
y" —9y =e * f) y” —2y' — 3y =sint 


29. Solve the Cauchy problems: 
y” +2y'+5y=0 y”’ —5y' +4y = 2t+1 
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30. Integrate the following linear ODEs of order n: 
yl" ze yl" —~2y =0 b) yl" = 2y"" a y/ =i 


y4) — 5a!" + Ty" — 5y’ + Gy = sint 
Determine the general integral of y" + y’ — 6y = e* in function of the real k. 


32. Determine the general integral of the differential equation y" — 2y'’+(1+k)y = 
0, as k varies in R. 


Determine the general integral of 
y!" — Qy" + 49y’ — 98y = 48 sint + (6? + 49)e%* 
for every 8 inR. 


Eh 


34. Determine the general integral of the ODE y’" + 9ay' = cos3t in function of 


acR. 


Discuss the stability of the origin in R? for the equation 


y = Ay with A={0 -1 0 


Study the stability of yo = (—3,1) for the following equation in R?: 


y =f(y) where f(y) = (8y1y2 — 2y5 + 11)é t+ (yr + 3y0)7. 


10.9.1 Solutions 


1. ODEs with separable variables: 


a) The map h(y) = y has a zero at y = 0, which is thus a singular integral. 
Suppose y # 0 and separate variables, so that 


1 t#+2 ct? 
Sdy= | — = dais — SoS, 
i lacs + ed) 


Passing to exponentials, 


t? 


= y(t) = c —— 0. 
y=ui)=er >, eF 


The singular integral y = 0 is obtained by putting c = 0 in the general formula. 
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1 + clog” t 
1 —clog?t’ 


cER. 


c) y= y(t) 


2. Homogeneous ODEs: 
a) Supposing ¢t 4 0 and dividing by 4¢? gives 


Substitute z = 4, so that y’ = z + tz’ and then 


Sia pee 
4 2 A 


Atz’ = (z-—1)(z +8). 
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Since y(z) = (z — 1)(z + 3) vanishes at z = 1 and z = —3, we get y = t and 
y = —3t as singular integrals. For the general integral we separate the variables, 


| opera’ [7% 


Then exponentiating 


log 


—1 
<5 | = tog cl, c>0 


and solving for z, we have 


1+ 3ct 
oe cER, 
Lc 


in which the singular integral z = 1 is included. Altogether, the general integral 


of the equation is 
_ t+ 3c? 


1—cat 


; cER. 


t 


a TEER TIE 20s 
log log c|t| 7 


b) y=y= 


3. Linear equations: 


a) Formula (10.24), with a(t) = $ and b(t) = —34, gives 


3t +2 3 2 
yaol tat festa (22) di = 4 ae Pick, cER. 


b) y=2tarctant+ct, ceER. 
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4. Bernoulli equations: 


a) In the notation of Sect. 10.3.4, 


and set z = z(t) = yy? = 5; 


| 
et 
> 
o) 
5 
X 
| 


— sry! and the equation reads z’ = 


1 . 
1 — £z. Solving for z, 


Therefore, 


to which we have to add y(t) = 0. 


b) We have 
1 
p(t) = tlogt, q(t) = - a=-l. 


t ? 
Set z = y”, so z’ = 2yy’ and the equation reads 


pb. -2 
2 = zu t a lost. 


Integrating the linear equation in z thus obtained, we have 


z= 2(t) = t (log? t+ 0), ceER. 


y = y(t) = +ty/log? t+, cER. 


5. The ODE is separable. The constant solution y = 0 is not valid because it fails 
the initial condition y(0) = 1. By separating variables we get 


1 1 
ae ae 
l= y lx a 


1 
log |1 — e#| = 5 log |2t+ I] +c, ceER. 


Therefore, 


Then 


Solving for y, and noticing that y = 0 corresponds to c = 0, we obtain the general 


integral: 
168 (1 — ct +1) - ceR 
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The datum y(0) = 1 forces c= 1 —e, so 
y = log (1+ (e-1)V/Rt+ 1). 
6. The equation’s general integral reads 
ya er IPH fol Mer dt =o M(t-+0), ceER. 


The condition is y’(0) = 0. Setting 2 = 0 in y(t) = —2y(t) +e~*! gives the new 
condition y(0) = 4, from which c = 4. Thus, 


1 

—2t 
— t+—). 
y=e (++ 5) 


8 sint — 1 
i,yg=1 (2 “(est-4) — 1). 8. y = ————— > . 9. y= ——. 
ae . : (4 — t?)3/2 ¢ cost 

10. The equation is 
t 1 t 
” ri 2 ide i.e. i -a pee 
yy =y ey Ld gY+ ap 


It can be made into a Bernoulli equation by taking 


pit)=s5 q(t) = = a=—2. 


Set z = z(t) = y3, so that 2’ = 3y?y’ and 2’ = 32+ 3t. Solving for z, 


z= 2(t) = ce8/%* 4 - = 


2 
y = y(t) = {/ce8/2)t — ¢ — =: 


The initial condition y(0) = 1 finally gives c = 5/3, so 


Therefore 


5 2 
— t — > _ (3/2)t_ _4-—L. 
y = y(t) 3¢ 3 


1 1 2 
ll. y=y(t) = -P+—-— =. 
J eer ear ae 
12. This second-order equation can be reduced to first order by y’ = z(y), and 
14 
observing 2’ = —-;. Then z = 2z(y) satisfies = -3 + c,. Using the initial 
ZY 


conditions plus y’ = z(y) must give 


(-V3)? =-1l+eq ie, c =4. 


498 10 Ordinary differential equations 


Therefore 4 9 
a sy? —1) hence z=--v/y?-1 
Y y 

(the minus sign is due to y'(0) = —V/3). Now the equation is separable 

2 

y =—-vVy?-1 
y 
Solving 


Jy? —-1 = —2t+ co 
with y(0) = 2 produces cz = V3, and then 


y = y(t) = /1+ (V3 — 2t)?. 


By=yo=™. 


14. The ODE is linear, and the general integral is straightforward 


y= eJ 2+e) dt [or form a(—aer dit = aa gl 4 ce) ceR 


From y(0) = 3 we find 3 = 1+ c, so c = 2. The solution is thus 
y =e" (1 + 2e**). 
15. Directly from the formula for linear ODEs, 
yaertit (3 fers tae at) te (3 [#-*ar) 


3 b—atl 
a EE a f —_ —] 
t (; ; i? te) ifb-—a Az —-l, 


t® (3 logt + c) ifb-a=-—l, 
3 b ak . 
——_—}" a fb— —l 
=< b-a+l1l me 7 ae 
3t° log t + ct® ifb-—az=-—l. 
Now, y(2) = 1 imposes 
3 bt 4 pga 2] ifb-—a#é-1, 
b-—a+l1 
3:2°log2+¢2*=1 ifb-—a=-—l, 
sO 


e=2°° (1-3) ifb—aA~—1, 
—a 


c=2°-°— 3log2 ifb-—a=-—l. 
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The solution is 


3 : 3 ; 

a OE ge fh = 
ja0=enl ( b—-atl se Alles 

3t* logt + (2~% — 3log2) t? ifb-—a=-—1. 


16. The integral of this linear equation is 
ya orbs tat f 8S tat pg dt =F tee ¥, ceER. 
Condition y(0) = 0 implies c = —§. Therefore 


1¥(-e¥). 


17. Solving the ODE y! = 4523: 


2(1+4t) ° 
1—c/[1 + 4¢]’ 

b) yo(t) = 3- viit 4] ee. 
1+ /j1+4¢| 


18. We have dom f = {(t,y) € R?:t+y> 0}. As 


c € R, plus the constant solution y(t) = —1. 


i 1 
Vi (Sas) 


f is Lipschitz in y on every 2 where 5 vee is bounded. Since 


1 1 
sup ————— 
t€(a,8), ye(y,d)2VETY 2/atry 
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the condition holds for any real a, y such that a+~y > 0. The end-points 6, 6 are 


completely free, and may also be +00. 


19. Global solutions: 
a) We have 


Of. _ Yj O fe i 


=~ = ‘ a ia aCe ee 
dy, lt+yi +3 Oy; 14+(y1 +42)? 


Of; 
OY; 
on the whole J = R. 


so | 


| <1 on R?. Hence f is Lipschitz on D = R?, and every solution exists 
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b) The map f is certainly differentiable on R", with continuous partial derivatives 
(as composites of C! elementary functions; besides, the denominator is never 
zero). Partial derivatives are thus bounded by Weierstrass’ Theorem, making 
f locally Lipschitz on R”. Moreover, 


ik 


IFW < Toga): Ill Vy € R”. 


Therefore Theorem 10.19 holds, and any solution exists on J = R. 


20. Existence of first integrals: 
a) As y- f(y) = 0 the equation is conservative, so &(y) = $\|y||? is a first integral 
(Example 10.22 i)). 


b) Since div f(y) = 8y1y3 + 2y2 — (8y1y3 + 2y2) = 0 on R?, f is a curl, and the 
equation admits a first integral @ such that curl = f. Then 


OP 
oe =4y?y3+2yiyo, —-—— = —(2yyg+y3); 
Y2 1 


integrating the first gives 
P(y) = yiys + yiys + c(y1), 
and using the second we find c(y;) = constant. Hence a family of first integrals 
is 
P(y) = ytya + yyg te. 


21. General integrals: 


1 1 
et bt 
a) y(t) =cie (5) ee Ge 
_ _ 3¢ ( cos4t 3r ( —sin4t 
ye) — ete Gea eee ( cos 4t ) , 
c) The matrix A has eigenvalues A; = 1, A2 
eigenvectors 


= 2, As = 3 corresponding to 


v1 = (1,0,3)7, vg = (0,1,0)7 , = (2.5.5)" « 
The general integral is thus of type 


1 0 2 
y(t) =cye’ | 0 | tege7* | 1 | +c3e | 5 
3 0 5 


d) 
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A has eigenvalues A; = 3, A2,3 = 2 + 32 with eigenvectors 


v, = (2,0,1)7, v2.3 = (1,+i,1)". 
Thus 
2 cos 3t sin 3t 
y(t) = cye** | 0 | +cge7* | —sin3t | + ce” | cos3t 
1 cos 3t sin 3t 
is the general integral. 
1 cost —sint 
y(t) =c, | O} + coe ** | —3cost—sint | +c3e7* | —cost + 3sint 
0 8cost+6sint 6cost — 8sint 


A has eigenvalues A; = 4 with multiplicity 2, and Ag = —8. The eigenvectors 
of Ay are v\) = (1,0,0)7 and v\?) = (0,3,2)", while the eigenvector corres- 


ponding to Az is v2 = (0,3, —2)”. Therefore 
0 1 0 
y(t)=cqe | 3 | +cect4 | 0 | + c3e% | 3 
—2 0 2 


The matrix A has one eigenvalue \ = 3 of multiplicity 3. There are two 
linearly independent eigenvectors vy) = (010) y” = (0,0,1)7. Moreover, 
the latter gives a generalised eigenvector r; = (1,0,0)?. Therefore 


0 0 0 1 
y(t) =c,e** | 1 | +epe** | 0 | +c3e | t{ 0] + [ 0 
0 1 1 0 


The matrix A has a unique eigenvalue \ = 1 with multiplicity 3, and one 
eigenvector v; = (0,0,1)7. This produces two generalised eigenvectors r1 = 
(1/4,0,0)7, ro = (0, 1/20,0)7. Hence 


0 0 1/4 
y(t)=cie’ | 0} +eoe | t] OO) +] 0 
1 il 0 
re 0 1/4 0 
+c3e zo] ttt Oo | + [ 1/20 
ik 0 0 
is the general integral. 
22. Particular integrals: 
a) We have 
0 —1 0 
yt)=[{[ O J]e#?+] 0 Jt+]1 
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b) We have 
~25/18 
y(t) = | —7/18 |} e~*. 
7/144 
c) A particular integral is 
0/8 0 
y(t) = 0 cos2t+ | —1/3 ] sin2t. 
1/6 => 
23. We have 
1 ot At 2 24 At 4 _ot At 
yr (t) 3° (Ot Be y2(t) = go ge y(t) Se Tae 


24. The system reads, in normal form, 


Ps. _f 1 2 
y = Ay with A= (4 ve 


The eigenvalues of A are 1,2 = 2 +i corresponding to vi,2 = (2,14 i)’. The 
general integral is 


2cost 2sint 
ot 
y(t) =e (« io + C2 Oech) : 


Imposing the constraints gives cy = 1, co = —2. 


25. The eigenvalues of A = & 2) are Ay. = —12E 8. 


If b 4 0, they are distinct, with corresponding eigenvectors v1 = (1,1)7, v2 = 
(1,—1)", and the general integral is 


y(t) = cyeO-Le (;) 4 coe rhe & 


If b = 0 the eigenvectors stay the same, but the integral reads 


Pe nC: (;) tee &) 


In either case the initial datum gives cj = 1 and cz = 0. The required solution is, 


for any b, 
y(t) = 1 a | 
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26, For a#% 4, 6% 1, 


2a—1 2a —2 0 
yt) = ee? 1 3 = 9a-) oe | Da~ |S cge(3e-De 1); 
Le 0 0 
fora = 1, 
y(t)=cie’ | 1 | t+eoe* {1 | t+ese* {tf} 1] + { 0 . 
fora = Lo, 
es 0 —1/2 
y(t) = ce8/2* | 3/2 | 4epe/* (1 | tege/* (41) 4 (0 | 
0 0 1/2 


27. Calling y; = 2, yo = y, y3 = y’ we may write y’ = Ay where 


2 -l O 
A= 0 0 1 
-—10 5 =2 


The matrix A has eigenvalues A; = 0, Az = 3, A3 = —3 and, correspondingly, 
v; = (1,2,0)", v2 = (1,-1,—-3)7, v3 = (1,5,—15)?. 


Consequently, the general integral is 


1 1 1 
y(t)=c, [| 2] + me | =1 | pee % 5 
0 —3 —15 
1.e., 
a(t) =e, + ce™ + cze**, y(t) = 2c, — cpe** + 5e3e7**. 


28. Second-order linear equations: 
a) y(t;c1,c2) = cie* + cge7 7% + st? — St + 2 , €1,c02 ER. 
b) We solve first the homogeneous equation. The characteristic equation \? —4\+ 


4X = 0 has one double solution A = 2, so the general homogeneous integral will 
be 


Yo(t; Cis C2) = (cy + cot)e** , ¢€1,c2ER. 


Since pp = \ = 2, we look for a particular integral y,(t) = at?e?*. This gives 


by substitution, hence a = 4. Therefore y,(t) = 4t?e?* and the general integral 
2 P 2 
is 
1 
y(t; C1, C2) = (ey + egt)e*® + she" C1,€2 ER. 
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c) 
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The characteristic equation A? +1 = 0 has discriminant A = —4, so o = 0 and 
w = 1. The general integral of the homogeneous equation is 


yo(t; C1,C2) =c, cost+cysint, c1,c2ER. 


As 4 = 0 = 0, we need to find a particular integral y,(t) = t(a cost + @sint). 
By substitution, 
—2asint + 28 cost = 3cost, 


so a = 0 and 6 = 3. Therefore y,(t) = 3¢cost and the general integral is 
. 2 
y(t;c1, C2) = c, cost + cosint + gi cost, c1,c2 ER. 


y(t; C1, C2) = cye’ + cge** — te’, c1,c2 ER. 


The characteristic equation \? — 9 = 0 is solved by A = +3. Hence the general 
integral of the homogeneous ODE is 


—3t 3t 
yo(t;c1,c2) =cie ““+ce", c1,c2ER. 


The particular integral must have form y,(t) = ate~**, so substituting, 
—6ae ** = e * 


hence a = —Z. Therefore y,(t) = —Zte~* and the general integral of the 
equation is 


1 
y(t; 1,02) = ae tae" = ate c1,c2 ER. 


y(t; c1, C2) = cre? + ege®* + = cost — zsint, c1,c2 ER. 


. Cauchy problems: 


y(t) =e ‘sin 2t. 
Let us treat the homogeneous equation first. The characteristic equation A? — 
5A+4=0 gives \=1, \ = 4. Thus the homogeneous general integral is 


3 _ t At 
yo(t; C1, C2) =cje +c2e 7" C1,C2 € R. 


Substituting the particular integral y,(t) = at + 6 in the equation we find 
—5a+ 4at +468 = 2t+1, 


so a = $ and @= f. Therefore yp(t) = st + < and the general solution is 


1 7 
y(#; ¢1, C2) = ce” + coett + 3° + 8° c1,c2 ER. 
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Imposing the initial conditions gives the system 
Cy tog = 0 
Paes ; 


so Cc] = z and co = —Z. Therefore 


30. Linear ODEs of order n: 
a) The characteristic polynomial y(A) = A? + A? — 2 has a real root Ay = 1 and 


two complex-conjugate zeroes Ag, 3 = —1 +7. Then the general integral, in real 
form, is 
y(t) = cre’ + cge ‘cost + cze* sint, C1,€2,c3 ER. 
b) y(t) = cye’ + cote’ +c3, C1,€2,c3 ER. 


c) The characteristic polynomial y(\) = A*—5A°+ 7A? —5A+6 has roots Ai = 2, 
Ag = 3, A3,4 = £1. The homogeneous general integral is thus 


yo(t) = cje" + cote** + cz cost + ca sint, C1, €2,€3,c4 ER. 
Because of resonance, we search for a particular integral of type 
Y(t) = t(asint + B cost) . 
Differentiating, substituting in the ODE and comparing terms produces the 


system 
a+P=0, 
10a —106=1, 


i =1720;, 
whence 
p= -1/20. 


In conclusion, 
i| 
y = y(t) = yo(t) + at (sint — cost) 


is the general integral. 


31. The associated homogeneous equation has general integral 


yo(t) = ee + oe 5 


as one easily sees. The particular integral of the equation depends on k. In fact, 


ae*t ifk4~2,k 4-3, 
ate’ ifk=2, ork =-3 


yp(t) = { 
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with the constant a to be determined. Differentiating this expression and substi- 
tuting, 


1 
——_————. ifk 42,k 4-3 
a) HDF) ree a 
— 1 
kad ifk=2,ork=-38. 
Therefore, 
1 
2t —3t kt > 
cje~" + coe 7 + ————————e ifk~A2,k4#-8, 
iO ae 
ee" + exe + ya i k=2, ork=-3. 
32. We have 


e' (cy cos Vk t + cg sin Vkt) ifk> 0, 
y(t) = 4 e*(c1 + tea) hi a 0 
cre tVFt 4 cyell-V—t ith <0, 
with c1,c2 € R. 


33. The characteristic polynomial 


x(A) = AP — 2d? + 490 — 98 = (A — 2)(0? + 49) 


has roots A; = 2, A2,3 = £77. The homogeneous integral reads 
yo(t) = cye** + co cos 7t + cg sin TE, C1,€2,c3 ER. 


Putting b(t) = 48sint and bo(t) = (6? + 49)e*", by the principle of superposition 
we begin with a particular integral of the equation with source 6, of the form 
Yp, (t) = acost + bsint. This will give a = —1/5 and b = —2/5. 

The other term bz depends on the parameter 6. When 6 4 2, we want a 
particular integral yp, (t) = ae®’. Proceeding as usual, we obtain a = 1/(6 — 2). 
When 6 = 2, the particular integral will be y,, (t) = ate’ and in this case we find 
a= 1. So altogether, the general integral is 


‘ yo(t) + gige” — gcost — 2sint roe, 
7 yo(t) + te” — i cost — 2sint i pS. 
34. We have 


1 
Cte ae ONO 4 i ifa <0, 
27(a — 1) 


1 
cr + cat + et? — 5— sin 3t ifa=0, 


1 
c, + co cos3at + c3sin3V/at + Tea reas 1, 
a-— 


t 
GCOS STC eM On = COHN ipo 1. 
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35. The eigenvalues of A are Ay = —1, A2,.3 = — S£V1O3 so Re A < 0 for both. The 


2 
origin is thus uniformly asymptotically stable (see Proposition 10.46). 


36. We have 


If(-3,1) = & a) 


The matrix 


has eigenvalues \; = —8, Az = 2, so the origin is unstable (Corollary 10.48). 


Appendices 


A.1 


Complements on differential calculus 


In this appendix, the reader may find the proofs of various results presented in 
Chapters 5, 6, and 7. In particular, we prove Schwarz’s Theorem and we justify 
the Taylor formulas with Lagrange’s and Peano’s remainders, as well as the rules 
for differentiating integrals. At last, we prove Dini’s implicit function Theorem in 
the two dimensional case. 


A.1.1 Differentiability and Schwarz’s Theorem 


> Proof of Proposition 5.8, p. 163 


Proposition 5.8 Assume f admits continuous partial derivatives in a neigh- 


bourhood of xo. Then f is differentiable at xo. 


Proof. For simplicity we consider only the case n = 2. 

Let then x = (x,y) be a point in the neighbourhood of x = (20, yo) where the 
hypotheses hold. Call (h,k) = (a — x0, y — yo); we must prove that for (h,k) > 
(0,0), 


Flexo + hy yo +k) = Feo, 40) + SA(¢o, woh + 52 (xo, yo)k + of VEER). 


Using the first formula of the finite increment for the map x +> f(x, yo) gives 


O 
f (xo +h, yo) = f (Zo, yo) + oF (ag, yo)h-+o(h) => 0. 
At the same time, Lagrange’s Mean Value Theorem tells us that y+> f(xo +h, y) 
satisfies 


O 
f (zo + h, yo + k) = F(v0+ hyo) + A (eo +h.) 


C. Canuto, A. Tabacco: Mathematical Analysis II, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 85, DOI 10.1007/978-3-319-12757-6_A1, 
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a) 
for some 7 = 9(h,k) between yo and yo + k. Since as is continuous on the neigh- 
Yy 


bourhood of (20, yo), we have 


Ct a = (no, vo) + o(1), (x, y) > (20, Yo), 


and because (%p + h, Y) + (Xo, yo) for (h,k) > (0,0), we may write 


O (6) 
See ae = (va.t0) +o(1), (ak) + (0,0). 


In conclusion, when (h, k) — (0,0), 


f(zo + h, yo + k) = f (Xo, yo) + oH (a, yo)h + 5 (20, vo) + o(h) + o(1)k. 


But o(h) + o(1)k = o(Wh? +k?), (h,k) + (0,0). In fact, |hl, |k| < Vh? + k? 


implies 
lo(h)| lo(h)| _ | o(h) | 
es = 0, h,k) + (0,0 
ame lo(t)k|__ Jo(1)k| 
O O 
ae = jo(1)| 0, h,k) — (0,0 
oa < Ee = lol) (h,k) + (0,0) 
The claim now follows. fl 


> Proof of Schwarz’s Theorem, p. 168 


2 


CLO, 


Theorem 5.17 (Schwarz) If the mized partial derivatives and 


Of 
OL, 01; 
they coincide at xo. 


(j 4 i) exist on a neighbourhood of xo and are continuous at xo, 


Proof. For simplicity let us only consider n = 2. We have to prove 


en ee ae 
dydr 0, Yo) = dxoy Xo, Yo), 


under the assumption that the derivatives exist on a neighbourhood B,.(x, yo) of 
Lo = (Xo, yo) and are continuous at Xo. 

Let « = (a,y) € B,-(xo, yo) and set (h,k) = (@ — xo, y — yo). Consider the 
function 


gh, k) = f(@o +h, yo +k) — f(vo +h, yo) — f(@o, yo + k) + f (Xo, yo) - 
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Putting v(x) = f(x, yo + k) — f(x, yo) we see that 


g(h, k) = pao + h) — (ao) - 


The second formula of the finite increment for the function x +> v(x) gives a point 
€ = &(h,k), between xp and rp +h, for which 


g(h, k) = ny (€) = A(SE(E, yo + &) — 34 (6,)) 


x x 


Let us use the same formula again, this time to ¢(y) = L(g ,Y), to obtain a point 
n = (h,k) between yo and yo + & such that 


Of Of eee 
7, (Yo + k) — Ba 6 Yo) = ore 


(En) 


and so 92 
h,k) = hk —— : 
g(h, k) ayant) 
Now letting (h,k) — (0,0) we have (€,7) — (o,yo); therefore, since pee is 
continuous at (x9, Yo), we will have 


g(h, k) 0 f O° 


J 
ae = , Al 
I ey Te Gone > oe (A.1.1) 


But we can write g as 


g(h, k) = f(xoth, yotk)—f (xo, yotk)—f (ao+h, yo) +f (x0, yo) = b(yotk)—v(yo) , 


having put o(y) = f(o +h, y) — f(a, y). Swapping the variables and proceeding 
as before gives 


_ g(h,k) _ Of 
] = —— : A.1.2 
(h,k)2(0,0) hk Dady Po yo) ( ) 
Since the limit on the left in (A.1.1) and (A.1.2) is the same, the equality follows. 
O 


A.1.2 Taylor’s expansions 


> Proof of Theorem 5.20, p. 172 


Theorem 5.20 A function f of class C? around x9 admits at xo the Taylor 
expansion of order one with Lagrange’s remainder: 


f(a) = f(a) + V F(a) -(@ ~ a0) + 5(@ — @0) Hf ()(@ — a0), 


where & is interior to the segment Sx, xo]. 
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Proof. Set Aw = # — a = (Az;)1<i<n for simplicity. We consider the function 
of one real variable y(t) = f(a + tAa), defined around to = 0, and show it 
is differentiable twice around 0. In that case we will find its Taylor expansion of 
second order. Property 5.11 ensures that y is differentiable on some neighbourhood 
of the origin, with 


g'(t) = Vf (wo + tax). Ax =S~ Ax; of 


Ox; 
i=l 4 


(ap + tAax). 
Now set a; (t) = SL (ax +tAa) and use Property 5.11 on them, to obtain 


Of 
OL;OR; 


Of 
Ox; 


p(t) = V( ) (xo +tAax)- Ax = S- (a9 +tAx)Ag; . 


This implies y can be differentiated twice, and also that 


M 
yp (t)= >. Az; (29 + tan) Az; 
= Az - Af(ao+taAx)Az, 


as Hf is symmetric. 
The Taylor expansion of y of order two, centred at to = 0, and computed at t = 1 
reads 


1 = - 
y(1) = v(0) + y’(0) + 5¢' con 0<#<1; 


substituting the expressions of y’(0) and y’(t) found earlier, and putting © = 
xo +tAzx, proves the claim. Oo 


> Proof of Theorem 5.21, p. 172 


Theorem 5.21 A function f of class C? around x9 admits at ao the following 
Taylor expansion of order two with Peano’s remainder: 


Fae) = (eo) + Vf (wo) (w — &o) + 5(ae — a0) - Hf sro) ae — 20) 


+o(||e—2xo||?), & > ao. 


Proof. Consider the generic summand 


1 ef 
2 OxnjOR; 
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in the quadratic part on the right-hand side of (5.15). As the second derivative of 
f is continuous at xo and & belongs to the segment S{x, ao], we have 


Oy ae of 
@L—->Lo 
oy oT 
@)Ax, Ax; = Ax Ax; + nij(x) Ari An; . 
Ox, 0X; aaa Ox 5 OX; (wo) Ati; Aa; + nis(w) Axi Ax; 


We will prove the last term is in fact o(||Aa||?); for this, we recall that 0 < 
(a — b)? = a? + b? — 2ab for any pair of real numbers a,b, so that ab < $(a? + b?). 
Now note that the following inequalities hold 


1 1 
|Az;| |Ax,| < 5(|Aai|” + |Ax,|*) < 5||Aall’, 


so 
ni3 (x) Ax; Az;| iL 
0 < =|n;; 
and then high 
.. mye) Ag Ae; 
] =). 
eo [Aa 


In summary, 


s(x — xo) - Hs()(@ — @o) = s(x — xo) - H(xo)(x — x0) 


+0(||a@ — xo||7) , Lr Xo, 


whence the result. Oo 


A.1.3 Differentiating functions defined by integrals 


First, we state the following result, of great importance per se, that will be used 
below. 


Theorem A.1.1 (Heine-Cantor) Let f :dom f C R” > R” be a continu- 


ous map on a compact set 92 C dom f. Then f is uniformly continuous on 
G2: 


Proof. The proof is similar to what we saw in the one dimensional case, using 
the multidimensional version of the Bolzano- Weierstrass Theorem (see Vol. I, Ap- 
pendix A.3). 
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&> Proof of Theorem 6.17, p. 215 


Proposition 6.17 The function f defined by (6.14) is continuous on I. 


Moreover, if g admits continuous partial derivative a on R, then f is of 


class C! on I and 


b 
rie) = [| Pew ay. 


Proof. Suppose 9 is an interior point of I (the proof can be easily adapted to the 
case where Zp is an end-point). Then there is aa > 0 such that [zo 0, xo +0] C J; 
the rectangle E = [x9 — 0,%9 +0] x J isa compact set in R?. 

Let us begin by proving the continuity of f at x. As we assumed g continu- 
ous on R, hence on the compact subset EL, Heine-Cantor’s Theorem implies g 
is uniformly continuous on FE. Hence, for any ¢ > 0 there is a 6 > O such that 
\g(a@1) — g(a2)| < € for any pair of points x; = (21,41), 2 = (2, y2) in EF with 
|| — a|| < 6. We may assume 6 < o. Let now x € [x9 — 0, 9 + 0] be such that 
|x — xo| < 6; for any given y in [a,b], then, 


I(x, y) = (xo, y)|| = |z — xo| = é, 


so |g(x, y) — g(%0, y)| < €. Therefore 


b b 
|f (x) —f (o)| = [| (g(, y)—g(x0.4)) ay] < | |9(x, y)—9(a0,y)| dy < e(b—a), 


proving continuity. 


O 
As for differentiability at x29, in case = exists and is continuous on I x J, we 
x 


observe 
May) - | (a(v.9) ~ (00.9) au = f Hey ND ay, 


Given y € [a,b], we can use The Mean Value Theorem (in dimension 1) on 7 + 
g(x,y). This gives a point € between x and xo for which 


g(x,y) — g(to,y) _ Og 
gammy ae 


Og . ; ; 
By assumption, oe is uniformly continuous on FE (again by Heine-Cantor’s The- 
ze 
orem). Consequently, for any « > 0 there is ad > 0 (6 < o) such that, if 71,2 € E 
0 a) 
with ||a1—a2|| < 6, we have | (wi) (we)| < e.In particular, when x1 = (€, y) 


and #2 = (Xo, y) with ||a | — x9|| = |€ — x0| < 6, we have 
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og 


O 
Fa (o y) ~~ = (20,9)| <€. 


Therefore 


ZL — XO 


= xr b 
ae ae a _ | “8 (9,4) dy 


[ (s& y) ~ (20,4) _ 89, 


XL — Xo Ox 


[ (she y) — 200.0) ) dy 


which proves differentiability at x9 and also the formula 


f'(o) = 


> Proof of Theorem 6.18, p. 215 


Proposition 6.18 Jf a and 8 are continuous on I, the map f defined by (6.15) 


O 
is continuous on I. If moreover g admits continuous partial derivative = on 
xv 
R anda, B are C! on I, then f is C! on I, and 


B(x) 
oe i) Le yb oun) =a ene). 
a(x) Ox 


Proof. The only thing to prove is the continuity of f, because the rest is shown 
on p. 215. 

As in the previous argument, we may fix x9 € J and assume it is an interior 
point. Call E = [%9-—0, x9 +0|x J C R, the set on which g is uniformly continuous. 
Let now ¢€ > 0; since g is uniformly continuous on E and by the continuity of the 
maps a and £ on J, there is a number 6 > 0 (with 6 < o) such that |x — xo| < 6 
implies 


lg(z, y) — g(xo, y)| <e, for allye J, 
and 
lja(z)-aleg)|<e, |B) —Bleo)|<e. 
Then, setting M = max _|g(z,y)| gives 
(z,y)EE 
B(x) B(x0) 
Lf(c) ~ F(0o)| = f ) Semey- / sy Sezer a 
a(x a(xo 
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a(x0) B(xo) 
=], atev)ay + f g(x,y) dy + 
a(x a(xo 


B(«) B(x0) 
+ / Jena / sao) ay 
B(xo) a(x0) 


a(x) B(x) 
=i) (ayy + [ g(x,y) dy + 


(xo) 
< Mja(x) — a(zo)| + M|B(x) — B(zo)| + €|B(xo) — a(zo)| 
< (2M + |B(xo) — a(xo)|)e 


and f’s continuity at xo follows immediately. Oo 


A.1.4 The Implicit Function Theorem 


> Proof of Theorem 7.1, p. 263 


Teorema 7.1 Let 2 be a non-empty open set in R? and f : Q— RaC! map. 


Assume at the point (x0, yo) € 2 we have f(xo, yo) = 0. If 5p (20,0) =O; 


there exists a neighbourhood I of xo and a function py: I + R such that: 
i) (xp (x)) EQ for anyx€ I; 

wt) yo = Y(Xo); 

iit) f (x,y @) = 0 for anyx ET; 

iv) p is aC* map on I with derivative 


(A.1.3) 


On a neighbourhood of (x0, yo) moreover, the zero set of f coincides with the 
graph of vp. 


Proof. Since the map fy = ae is continuous and non-zero at (20, Yo), the local 
invariance of the function’s sign (§ 4.5.1) guarantees there exists a neighbourhood 
AC 2 of (xo, yo) where f, # 0 has constant sign. On a such neighbourhood the 
auxiliary map g(x,y) = —f2(x,y)/fy(a, y) is well defined and continuous. 
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Consider then the Cauchy problem (Section 10.4) 
{y' = g(x, y)y(@0) = yo- 


This admits, by Peano’s Theorem 10.10, a solution y = y(«) defined and of class 
C* on a neighbourhood I of xo such that (x,y(x)) € A for any x € J. Thus 
conditions i) and ii) hold; but then also iv) is satisfied, by definition of solution 
to the differential equation. As for iii), we define the map h(x) = f(x,y (z)): it 
satisfies, on J, 


h'(2) = fe (2, (@)) + fy (a, (2) 9’ (x) 
= fz(t,p(z)) + fy(x,y (2))9(z,9 (2)) 


= f2(¢,0(2))+f,(20@) (Fees) = 9; 


so it is constant; but as h(xo) = f(x0, yo) = 0, h is necessarily the zero map, 
wherefore 772). 
Concerning the last statement, note that if (x,y) € A with x € J, then 


Fey) = (ese (w)) +f Feensas= fi CT eaves 


pl 


as a consequence of the Fundamental Theorem of Calculus (Vol. I, Cor. 9.42) 
applied to y + f(x,y). The integral vanishes if and only if y = y(x), because 
the integrand is always different from 0 (recall the property stated in Vol. I, 
Thm. 9.33 iii)). Therefore on the neighbourhood of (xo, yo) where x € I we have 
f(x,y) =0 precisely when y = ¢(z). O 


A.2 


Complements on integral calculus 


In this appendix, we first introduce the notion of norm of a function, illustrated by 
several examples including norms of integral type. Next, we justify the Theorems 
of Gauss, Green and Stokes; for the sake of clarity, we confine our discussion to 
the case of specific, yet representative, geometries. The proof of the equivalence 
between conservative fields and irrotational fields in simply connected domains 
is the subsequent result. In the last section, we briefly outline the language of 
differential forms, and we express various properties of vector fields, discussed in 
the text, using the corresponding terminology. 


A.2.1 Norms of functions 


The norm of a function is a non-negative real number that somehow provides 
a measure of the “size” of the function. For instance, if f is a function and - 
is another function approximating it, the norm of the difference f — f gives a 
quantitative indication of the quality of the approximation: a small value of the 
norm corresponds, in a suitable sense, to a good approximation of f by means 
of f. 


Definition A.2.1 Given a family F of real functions defined on a set 2 C 
R”, that forms a vector space, we call norm a map from F to R, denoted by 
fo |f\l, that fulfills the following properties: for all f,g € F andallaecR 
one has 


i) ||f\| =O and ||f|| = 0 af and only if f = 0 (positivity); 


wt) \|af'|| = |al|| f|| (homogeneity); 
iii) || f + gl < lf Il + llg|| (triangle inequality). 


Note the analogy with the properties that define a norm over vectors in R”. 
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Remarkable examples of norms of functions are as follows. If F denotes the 
vector space of the bounded functions on the set 2, it is easily checked that 


Il fllo,e = sup | f(x)| 
ee 


is anorm, called the supremum norm or infinity norm. It has been introduced 
in Sect. 2.1 for 2 = A CR. If in addition 92 is a compact subset of R” and we 
restrict ourselves to consider the continuous functions on 2, namely f € C°(Q), 
then the supremum in the previous definition is actually a maximum, as the func- 
tion | f(a)| is continuous on (2 and Weierstrass’ Theorem 5.24 applies to it. So we 
define 


lf lleo(a) = max |f(x)| = Ilflloo,2 ; 


which is called the maximum norm in the space C°(2). 

Other commonly used norms are those of integral type, which measure the size 
of a function “in the average”. If 2 is a measurable set and F is the vector space 
of all Riemann-integrable functions on (2 (recall Theorem 8.20), we may define the 
absolute-value norm, or l-norm, as 


filo = | f(@)lde 


as well as the quadratic norm, or 2-norm, as 


Hike= ( [ yl@itde) , 


The latter norm has been introduced in Sect. 3.2 for 2 = [0,27] Cc R. The two 
integral norms defined above are instances in the family of p-norms, with real 


1<p< +o, defined as 
1/p 
like= ( [ (2)? dx ) | 


The quadratic norm is particularly important, since it is associated to a scalar 
product between integrable functions; its definition is 


oe [ f(w)9(a) der, 


and one has || f|l2,.0 = /(f, f)2,q. In general, the following definition applies. 
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Definition A.2.2 A scalar product (or inner product) in F is a map 
from F x F to R, denoted by f,g +> (f,g), that fulfills the following properties: 
for all f,9, fi, fo € F and all a,b €R one has 


i) (f,f) => 0 and (f, f) = 0 if and only if f = 0 (positivity); 
ti) (f,9) = (9, f) (symmetry); 
wit) (afi + Bf2,g) = a(fi,g) + B(f2,g) (linearity). 


It is easily checked that the quantity || f|| = (f, f)!/? is a norm, called the norm 
associated with the scalar product under consideration. It satisfies the Cauchy- 
Schwarz inequality 


MAGDISIFIMgI, VigeF. 


A scalar product allows us to define the concept of orthogonality between 
functions: two functions f,g € F are called orthogonal if (f,g) = 0. In a vector 
space endowed with a scalar product, the Theorem of Pythagoras holds; it is 
expressed by the relation 


If+gll’ =f? + Ill?  ifand only if = (f,g) =0. 


Indeed, one has 


Iftol? =(f+9.f +9) 
=(f,/A+ha9+(¢f)+(9,.9) 
= |f\/’ +2(f,9) + lll? , 


whence the equivalence. 


Going back to norms, for a differentiable function it may be useful to measure 
the size of its derivatives, in addition to that of the function. For instance, if Q 
is a bounded open set in R”, consider the vector space C!(Q) of the functions of 
class C' on the compact set Q (see Sect. 5.4); then, it is natural to define therein 
the norm 


IF llega = WF lleo@ + So [Dai flleogy - 
i=1 


If the maximum norms in this definition are replaced by norms of integral type 
(such as the quadratic norms of f and its first-order partial derivatives D,, f), we 
obtain the so-called Sobolev norms. 
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A.2.2 The Theorems of Gauss, Green, and Stokes 


> Proof of Proposition 9.30, p. 391 


Proposition 9.30 Let the open set Q C R® be G-admissible, and assume 
O 
f €C°(2Q2) with at 2OND) 76 La at. Wen 


| a dadydz = fni do, 
Q OX; aQ 


where n; is the ith component of the outward normal to O92. 


Proof. Without loss of generality take 7 = 3. As claimed, we shall prove the 
statement only for open, piecewise-regular sets that are normal for 73 = z. We 
begin by assuming further that 2 is regular and normal for z, and use the notation 
introduced in Example 9.27 ii). 

Integrating along segments and then by parts we obtain 


B(x,y) a 
Lx Lu, 2 Jdzdydz= ff 7 (Ee z) dz da dy 
ane) (A.2.1) 


= I f(a, y,B (x, y)) da dy — i. f(x,y, (x, y)) dx dy. 


AS Nap, = 1/||vg||, recalling (9.12) and (9.11), we have 


[ flex8 em) dedy =f F(e.78(@9))m5, (ea) leale. sll dedy 


D 


=i fnzdo. 
ar 


Likewise, nz,,, = —1/||val|, hence 


= i. f(2,y,0(a,y)) dae dy = i f (x,y,0 (#9) nayx, (€9)||Ya(a, y)|| dae dy 


= | fnzdo. 
Do 


Eventually, from n,, =, = 0 follows 


; jrede =: 
ae 
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We conclude that 


| ineeo= | frdo+ | fr.do+ f fnzdo 
(ohe) dig Da ae 


and the assertion follows from (A.2.1). 

Now let us suppose the open set is piecewise regular and normal for z, as in 
Example 9.27 iii). Call {Q,}4=1,....« a partition of 2 into regular, normal sets for 
z. Using the above result on each (2,, we have 


O 
[Z OF a dy dz = 3 fn™ do. (A.2.2) 
QO IQn 
If F = 02Q 1 OQ»y, denotes the intersection of two partition elements (intersection 
which we assume bigger than a point), then nin = —nip and 


[fn ao+ f fn ® ao =0. 
r 4 


In other terms, the integrals over the parts of boundary of each QQ, that are 
contained in (2 cancel out in pairs; what remains on the right-hand side of (A.2.2) 
is 

K 


ey fr ao = | fnzdo, 
a] J ORLNIL an 


proving the claim. 0 


> Proof of Green’s Theorem, p. 394 


Theorem 9.35 (Green) Let 2 C R? be a G-admissible open set whose bound- 
ary OQ is positively oriented. Take a vector field f = fii + faz in (CX) 
Then 


ne - He) eeu 


Proof. From (6.6) we know that curl f = (curl® )3, ® being the three-dimensional 
vector field f + 0k (constant in z). Setting Q = 2 x (0,1) as in the proof of The- 
orem 9.32, 


1 
| ES ae dedy = [ curl fdzdy = | | (carte )3 dx dy dz 
Oo <O8 Oy Q 0 JQ 


= | (une )3 da dydz. 
Q 
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Let us then apply Theorem 9.34 to the field ® € (C \@))*, and consider the third 
component of equation (9.21), giving 


[ (carte jade dyads = [ (NA ®)3do, 
Q oQ 


with N being the unit normal outgoing from OQ. It is immediate to check (N A 
®)3 = fon — fing = f -t on 0 x (0,1), whereas (N A ®)3 = 0 on 2 x {0} and 
§2 x {1}. Therefore 


1 
[ woand= fof waasarayae = fitdy= 4 7 *F; 
OQ 0) 02 O22 OQ 


and the result follows. 


> Proof of Stokes’ Theorem, p. 396 


Theorem 9.37 (Stokes) Let © Cc R® be an S-admissible compact surface 
oriented by the unit normal n; correspondingly, let the boundary OS’ be oriented 
positively. Suppose the vector field f , defined on an open set A C R® containing 
3), is such that f € (c1(A))”. Then 


[rowin-n- 


In other words, the flux of the curl of f across the surface equals the path 
integral of f along the surface’s (closed) boundary. 


Proof. We start with the case in which » is the surface (9.15) from Example 9.29, 
whose notations we retain; additionally, let us assume the function y belongs to 
C?(R). Where possible, partial derivatives will be denoted using subscripts 2, y, 2, 
and likewise for the components of normal and tangent vectors. Recalling (9.13) 
and the expression (9.16) for the unit normal of »’, we have 


| (enlf~jen= | Ce ee eee een ee ee 
aS R 
= Ge ae, ere, cia, 


where the derivatives of the components of f are taken on (a, yy (2, y)) as (x, y) 
varies in R. By the chain rule f;,, + fi,zYy is the partial derivative in y of the 
map f} (2, yy (2, y)); similarly, fo. + 3,22 represents the partial x-derivative of 
the map fo y, yy (a, y)). Moreover, adding and subtracting f3,.~xpy + fags, to 
formula f3,.2y — f3,yPx, we see that this expression equals 
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a (Js (x,y. (x, y)) Gy(@, )) 7 * (Js (2, y9 (2,4) Pale, ») 


Therefore we can use the two-dimensinal analogue of Proposition 9.30 to obtain 


| (curl f)-n= | ( — fing + fone + fa(Gynz — PuNy)) dy, (A.2.3) 
Z aR 
where nz, Ny are the components of the outgoing unit normal of O?R. 

If y : I > R? denotes a positive parametrisation of the boundary of R, then 
ny = —Y;/|\7'|| and nz = 74/|l|7’||. Furthermore the arc n : I > R® given by n(t) = 
(y(t), y2(t), p(11(t), y2(£))) is a positive parametrisation of OX’ with respect to 
the chosen orientation of 1’. The corresponding tangen vector is given by (9.17). 
Overall then, recalling the definitions of integral along a curve and line integral, 


; (— finy + fone + fa(Gynz — Pany)) dy = io + fong + fang) dt 
OR I 
= tee 
ax 


Equation (9.24) in the present case follows from (A.2.3). 
Let us now suppose » is made by K faces ),..., 7%, each as above. Using 
the result just found on every ¥,, and summing over k, we find 


K 


[ (cant f):n= ye f-7r™., (A.2.4) 


k=1 7 OX 


If F = 0,0, is the intersection of two faces (suppose not a point), the unit 
tangents to OX), and OX, satisfy ve = -t)?, hence 


fers [t-r =o. 
T r 


That is, the integrals over the boundary parts common to two faces cancel out; 
the right-hand side of (A.2.2) thus reduces to 


K 


8 A ae =f tm 


k=1 


proving the assertion. o 


> Proof of Theorem 9.45, p. 403 


Theorem 9.45 Let 2 CR”, with n = 2 or 3, be open and simply connected. 


A vector field f of class C'! on Q is conservative if and only if it is curl-free. 
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Proof. We have shown, in Proposition 9.43, the arrow f conservative > f curl- 
free. Let us deal with the opposite implication. 

First, though, we handle the two-dimensional case, and prove condition iii) of 
Theorem 9.42. Suppose + is a simple closed (i.e., Jordan) arc, (piecewise) regular 
and with trace I’ contained in §2; its interior »’ is all contained (2. Hence we can 
use Green’s Theorem 9.35 and conclude 


[ter=+ [cart fardy =o 
Y Dy 


(the sign on the second integral is determined by the orientation of I”). Should the 
arc be not simple, we can decompose it in closed simple arcs to which the result 
applies. 

Now let us consider the three-dimensional picture, for which we discuss only 
the case of star-shaped sets; build explicitly the potential f, in analogy to—and 
using the notation of-the proof of Theorem 9.42. Precisely, if Po is the point for 
which 2 is star-shaped, we define the potential at P of coordinates x by setting 


ee)=f fer 
[Po.P] 


where I{p,,p) is the segment joining Po and P. We claim grad y = f, and will 
prove it for the first component only. So let P+ AP = 2+ Ax,e;, € 2 bea 
nearby point to P, jp, pap) the segment joining Po to P+ AP, and Ip py apy 
the (horizontal) segment from P to P + AP. We wish to prove 


| fer- | fra f-T. (A.2.5) 
Ip ,P+AP] IP ,P] Ip, p+aAp] 


But this is straightforward if Po, P and P+ AP are collinear. If not, they form a 
triangle 3’, which is entirely contained in 92, the latter being star-shaped. Calling I" 
the boundary of +’ oriented from Po to P along I|p,,p}, we invoke Stokes’ Theorem 


and have 
[fre [euntan=o. 
- Fr 


| fers | fer- | f-7T=0, 
Ip9,P] I[p),P+AP] Ip, p+ap] 
whence (A.2.5). Therefore 


v(x + Arie) — v(x) 1 i pup 
Ip, p+ap] 


1.e., 


Ax, ~ Ax, 


iL 


Ax, 
= red fi(a + tei) dt, 


and we conclude as in Theorem 9.42. Oo 
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A.2.3 Differential forms 


In this section, we introduce a few essential notions about differential forms; 
through them, it is possible to reformulate various definitions and relevant prop- 
erties of vector fields, that we encountered in previous chapters. The forthcoming 
exposition is deliberately informal and far from being complete; our goal indeed is 
just to establish a relation between certain notations adopted in this textbook and 
the language of differential forms, which is commonly used in various applications. 

Let us assume to be in dimension 3; as usual, the reduction to dimension 
2 is straightforward, while several concepts may actually be formulated in any 
dimension n. So, in the sequel 2 will be an open set in R°. 


A differential 0-form F is, in our notation, a real-valued function (also called 
a scalar field) p = v(x, y, z) defined in 9, i-e., 


=o. 


A differential 1-form w is an expression like 


=F dr) dyn dz, 


where P = P(z,y,z), Q = Q(z, y,z) and R = R(z,y,z) are scalar fields defined 
in §2. In our notation, it corresponds to the vector field 


f=Pi+Qj+Rk. 


Thus, the symbols dz, dy and dz denote particular 1-forms, that span all the oth- 
ers by linear combinations. They correspond to the vectors 2, 7 and k, respectively, 
of the canonical basis in R°. 

An expression like 


fe= fe drz+Qdy+Rdz), 


where J’ is an arc contained in §2, corresponds in our notation to the path integral 


[fer 


Formally, the relationship may be motivated by multiplying and dividing by dt 
under the integral sign, 


daz dy dz 
=) (P40 a) a 
ic [( arr a) 


and thinking of y(t) = (a(t), y(t), z(t)) as the parametrization of the arc I’ (re- 
call (9.8) and (9.10)). 
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The derivative of a 0-form F is defined as the 1-form 


obviously assuming the scalar field y differentiable in 2. In our notation, this 
relation is expressed as 
f =grady. 


A differential 1-form w is called exact if there exists a 0-form F’ such that 


In our notation, this is equivalent to the property that the vector field f associated 
with the form w is conservative. 


It is possible to define the derivative of a 1-form w = P dr+Q dy+ R dz as 
the 2-form 


Oz Ox 


Oy Ox Oy 


If we identify the symbols dz, dy and dz with the vectors of the canonical basis in 
R° and we formally interpret the symbol ( as the external product of two vectors 
(recall (4.5)), then we have dy Adz = da, dz Adz = dy and dz Ady = dz; this 
shows that, with respect to our notation, the differential form dw is associated 
with the vector field 

g = curl f 


(recall (6.4)). In general, for a differential 2-form 


W = SdyAdz + TdzAdz +UdxAdy , 


where S = S(az,y,z),T = T(xz,y,z) and U = U(az,y,z) are scalar fields, it is 
possible to define the integral 

| vy 

z 


over a surface »/ contained in 2; it corresponds, in our notation, to the flux integral 


es 
py 


(recall (9.14)), where g = Si + Tj + Uk is the vector field associated with the 
form W. In particular, we have 


[w= few s)-n. 
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Hence, in the language of differential forms, Stokes’ Theorem 9.37 takes the elegant 
expression 


A differential 1-form w is called closed if 


in our notation, this corresponds to the property that the vector field f associated 
with the form is irrotational, i.e., it satisfies 


curl f =O. 
One has the property 
Jef = GIG) 0 
that in our notation is equivalent to the identity 
curl grad » = 0 


(recall Proposition 6.7). Such a property may be formulated as 


an exact differential 1-form is closed 


that corresponds to state that a conservative vector field is irrotational (see Prop- 
erty 9.43). If the domain (2 is simply connected, then we have the equivalence 


a differential 1-form is exact if and only if it is closed 


that translates into the language of differential forms our Theorem 9.45, accord- 
ing to which in such a domain a vector field is conservative if and only if it 1s 
irrotational. 


Basic definitions and formulas 


Sequences and series 


Geometric sequence (p. 3): 
0 


il 
Liat ag = 
n—- oo +00 


does not exist 


The number e (p. 3): 


1 n 
6= in (a+=) 
n 


N—- CoO 
Geometric series (p. 6): 


converges to 7 


diverges to +00 


is indeterminate 


Mengoli’s series (p. 7): 
ee 


(k-)k 


Generalised harmonic series (p. 15): 


ca converges if a> 1, 
El 


k=1 


diverges ifa < 1 
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Definitions and formulas 


Power series 


Convergence radius (p. 48): 


CO 
R=sup {x ER: Ss" apx* converges} 
k=0 


Ratio Test (p. 49): 
if = +00, 


few Jo eg eee if £2=0, 
k-r0o| Ak 
if0 << +00 


Root Test (p. 50): 
if = +00, 


jim 4/|ax| = 2 i f=0, 
+00 
1/é if0<£<+600 


Power series for analytic functions (p. 56): 


2 ek) (a 
Fle) = So Pee — a0) 


Special power series (p. 54 and 58): 


Definitions and formulas 


Fourier series 


Fourier coefficients of a map f (p. 82): 
1 27 


= Oe J 


f(x) dx 


ao 


27 
f(x) coska dz, 


27 
f(x)sinka dz , 


1 Qn 


—ikax 
= d kez 
x, | Swe x, € 


Ck 


Fourier series of a map f € Con (p. 85): 


rat (a, cos ka + by sin kx) & 3 ee" 
k=1 k=—o0o 
Parseval’s formula (p. 91): 


+00 


QT +oo 
| f(a)? dx = 2na2 + 0 (a2 +02) = 20 S> eal? 
0 


k=1 k=—0o 


Square wave (p. 85): 
-—1 if-cr<2<0, ee 
: 4 
O° dae (), Ser. ap a= z sin@2m + 1)x 
. 7 — 
1 if0<a<q, mH 


Rectified wave (p. 87): 


f(x) = |sina|, 
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Real-valued functions 


Partial derivative (p. 157): 
f(xo + Az e;) — f(@o) 


Gradient (p. 157): 

Vi (@o) = grad f(@o) = (2 (a0)) 
Differential (p. 162): 

far) (Av) = Vf (x0) - Aa 


Directional derivative (p. 163): 


of 
Ov 


Of 


O 
I Po) Ui a 


aa 


(20) = Vf (ao) + v 


(0) Un 


Second partial derivative (p. 168): 


af _ Oo (af 
ONGOR, (x0) 7 On; (55) (x0) 


Hessian matrix (p. 169): 
Omi 


J 7 


Taylor expansion with Peano’s remainder (p. 172): 
if 
Fle) = (eo) + VF{#0) - (@ — 20) + 5(@ — #0): Hf(@0)(@ — &0) + olla — ao|\”) 


Curl in dimension 2 (p. 206): 
or, at, 


lje— 
ss I Ox2 Ox, 


Fundamental identity (p. 209): 
curl grad f =V A (Vf) =0 


Laplace operator (p. 211): 


n 2 
Af = divgrad f=V-vf= So o4 
j=1 9 
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Vector-valued functions 


Jacobian matrix (p. 


O 
Jf (xo) = (52 - 


Differential (p. 203): 


dfz,(Axv) = Jf (ao) Ax 
Divergence (p. 205): 


Curl in dimension 3 (p. 205): 
Oa (Ofo\«,. f Om, Of8 24, | Ole - On 
— — = ae a ES oe k 
_ f ae f (52 52) a (54 Ox J - Or1 0x2 
Curl in dimension 2 (p. 205): 


af _ ah 
Or1 Ox 
Fundamental identity (p. 209): 


divcurl f =V-(VA f) =0 


curl f= 


Derivative of a composite map — Chain rule (p. 212): 
J(9° f)(#0) = Jg(yo) Jf (xo) 

Tangent line to a curve (p. 217): 

T(t) = y(to) + 7'to)(E-to), teR 

Length of a curve (p. 222): 


b 
=f Welle 


Tangent plane to a surface (p. 237): 
Oo Oo 
IT (u,v) = o (uo, vo) + By (uo Vo) (Uw — Uo) + Fy (uo v9) (v — vo) 


Normal vector of a surface (p. 238): 


O O 
v(uo, v0) = 5, (wo. vo) Ae Cie.) 


Ov 
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Polar coordinates 


From polar to Cartesian coordinates (p. 230): 


® : [0,-+oo) x R> R?, (r,0) + (x,y) = (rcos@,rsin 6) 


Jacobian matrix and determinant (p. 230): 


J®(r0) = ( 


6 —rsiné 
a ) det J@(r,0) =r 


sin@ rcosdé 


Partial derivatives in polar coordinates (p. 231): 


Of _ Of Of Of Of, Of 
Be ae cos @ + Dy sin 6, 70 = Ay? sine + ad 


OF 0) es dg sin @ OF 08 7g (OF Cont 


Ox Or OO or dy Or 06 or 
Variable change in double integrals (p. 320): 


[ fenaray= | f(rcos6,rsin 6) r dr dé 
Q Q 


Cylindrical coordinates 


From cylindrical to Cartesian coordinates (p. 233): 


® : [0, too) x R? > R®, (r,0,t) + (a, y, z) = (rcos6,rsin 8, t) 


Jacobian matrix and determinant (p. 233): 


cos? —rsinéd 0 
J®(r,6,t)= | sn@O rcosd O], det J@(r,0,t) =r 
0 0 1 


Partial derivatives in cylindrical coordinates (p. 233): 


ini Of Of __ Of Of Of _ Of 
eee aa rag a 70 Ay? ne + ae sO, Oz 


Of _ Of Of sind Of of Of cosé Of of 


ac Or Oe * Oy on Ok e ° aa BF 


Variable change in triple integrals (p. 328): 


| feuaarayae= | f(rcos0,r sin 6, t) r dr dé dt 
Q Q 
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Spherical coordinates 


From spherical to Cartesian coordinates (p. 234): 


® : (0, +00) x R? > RS, 


(r,~,0) + (x,y,z) = (rsinycos6,r sin ysin 9, r cos y) 


Jacobian matrix (p. 234): 
sinycos@ rcosycos@ —rsinysin#d 
J®(r,0,0) = sinysind rcosysin@  rsinwcosé 
cos —rsiny 0 


Jacobian determinant (p. 234): 
det J@®(r,p,0) =r? sing 


Partial derivatives in spherical coordinates (p. 233): 


af af. af...) af 
on oe sin y cos 6 + Dy sin ysin 8 + Ae C08 P 


of 
Op 


ts = —2Fr sin g sind + sor sing cosé 


cos y cos 8 + of cos y sin 0 Of si 
—-r —-r ind — —rsin 
Ox . Oy . Oz . 


Of Of . Of cospcosé Of sind 
an a aA rs 5p aint 
Of of . . Of cosysin@d Of cosé 
Oy Oe oe ir 00 rsiny 
Of 

a 


Variable change in triple integrals (p. 329): 


[ie y,z)dadydz= | f(rsinycos6,rsinysin9@,r cosy) r? sin y dr dy dé 
Q Q! 
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Multiple integrals 


Vertical integration (p. 309): 


[= [ (LC) rena) 


Horizontal integration (p. 309): 


[if -{ (oa f(x,y) es) dy 


Iterated integral (p. 324): 


g2(x, a 
ie =f 7 f(x,y, z)dz | da dy 
gi (x,y) 


Iterated integral (p. 325): 


fe-ff (, te.2)eay) de 


Variable change in multiple integrals (p. 328): 
| f(a) dQ =), f (®(u)) | det J&(u)| da 
Q Q 


Pappus’ Centroid Theorem (p. 333): 
vol(2) = 27yg area(T) 


Definitions and formulas 


Integrals on curves and surfaces 


Integral along a curve (p. 368): 
b 
[t= f fo@) role 
Y a 
Path integral (p. 375): 


[are [i= [ s00)-Y0a 


Integral on a surface (p. 378): 
[t= [ flotur)o(u,e dude 


Flux integral (p. 384): 


[temm f tom ff F(o(u0)) run) dude 


Divergence Theorem (p. 391): 


[wt [ten 


Green’s Theorem (p. 394): 


[G-DPoww=$ po 


Stokes’ Theorem (p. 396): 


[tempn=$ ter 
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Examples of quadrics 


Ellipsoid 


Hyperbolic paraboloid 
z_ vy 


we 


Elliptic paraboloid 


Definitions and formulas 


One-sheeted hyperboloid 


Two-sheeted hyperboloid 
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Index 


Arc, 139 
closed, 139 
end points, 139 
Jordan, 139 
length, 222-224 
meridian, 144, 332 
simple, 139 

Area, 304, 368, 378, 381 


Binormal, 226 
Boundary, 243, 247 
of a set, 120 


Canonical basis, 112 
Centre 

of a power series, 45 

of curvature, 226 

of gravity, 330 

of mass, 330, 374, 383 
Centroid, 330, 383 
Chain rule, 213 
Change 

of variable, 227, 317, 319 
Closure 

of a set, 120 
Coefficients 

Fourier, 85 

of a power series, 45 
Component, 247 

connected, 125 
Constraint, 277 
Convergence 

absolute, 41 


pointwise, 34, 41, 93, 94 
quadratic, 90, 91 
uniform, 35, 42, 95 
Coordinates 
curvilinear, 228 
cylindrical, 233, 328 
polar, 230, 318 
spherical, 234, 329 
Cross product, 113 
Curl, 205, 393 
Curvature, 226 
Curve, 138 
anti-equivalent, 220 
congruent, 220 
differentiable, 217 
equivalent, 220 
homotopic, 402 
integral, 426 
length, 222 
level, 268, 269 
opposite, 221, 371 
piecewise regular, 217 
plane, 138 
regular, 217, 222 
simple, 139, 222 
Cylindroid 
of a function, 298, 311 


Derivative 

directional, 159 

partial, 156, 159, 168, 171 
Determinant, 115 
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Differential, 162, 203 
Differential equation 
autonomous, 427 
Bernoulli, 437 
homogeneous, 432, 434, 474 
linear, 433 
non-homogeneous, 434, 476 
ordinary, 424 
Riccati, 437 
solution, 424, 426 
with separable variables, 430 
Divergence, 205, 391 
Dot product, 112 


Edge, 247 
Eigenvalue, 116 
Eigenvector, 116 
generalised, 466 
Ellipsoid, 144 
Error function, 58 
Estremum 
constrained, 274, 276 
unconstrained, 174 
Existence 
global, 446 
local, 440 
Exterior 
of a set, 120 


Face, 247 

Field, 128 
conservative, 209, 397 
curl-free, 209 
divergence-free, 209 


Function 


analytic, 56 

Bessel, 63 

continuous, 130 

defined by integral, 214, 516 
differentiable, 161, 203 
error, 58 

expansion in Taylor series, 56 
generically continuous, 307 
gradient, 158 

harmonic, 212 

implicit, 261, 263, 265, 266, 518 
integrable, 300, 307, 322, 335 
Lagrangian, 280 

limit, 34 

Lipschitz, 166, 203, 441 
locally Lipschitz, 447 
Lyapunov, 451 

of class C*, 171 

of several variables, 126 
orthogonal, 523 

partial derivative, 158 
periodic, 76 

piecewise C!, 95 

piecewise continuous, 79 
piecewise monotone, 93 
piecewise regular, 93 
rectified wave, 87, 95, 96 
regularised, 79 

sawtooth wave, 92 

scalar, 126 

square wave, 85, 95, 96 
vector-valued, 128, 201 


irrotational, 209, 403, 527 
of curl type, 209 

radial, 406 

vector, 158 


Flux, 383, 384 
Form 


differential, 529 
closed, 531 
exact, 530 

normal, 424 

quadratic, 118, 176 


Formulas 


Frenet, 227 
reduction, 309 


Frequency, 77 


Gibbs phenomenon, 95 
Gradient, 157, 205 


Helicoid, 144 

Helmholtz decomposition, 211 
Hessian of a function, 169 
Homotopy, 402 


Inequality 
Cauchy-Schwarz, 80, 112, 523 
triangle, 112 

Integral 
along a curve, 368, 373 
circulation, 376 
double, 297, 298, 304 


first, 451 
flux, 383, 384 
general, 427 
improper, 335 
iterated, 310, 324, 325 
lower, 300 
multiple, 297, 322, 335 
on a surface, 378, 380 
particular, 459, 470 
singular, 430 
triple, 297, 322 
upper, 300 
Integration 
reduction formula, 303, 310, 324, 325 
Interior 
of a set, 120 


Jacobian, 202 
Klein bottle, 247 


Lagrange multipliers, 278 
Lagrangian, 280 
Laplacian, 211 
Length, 222 
Level set, 127 
Limit, 132, 136 
Line 
coordinate, 228 
integral, 375, 377 
Linearisation, 455 


M-test 

Weierstrass, 43 
Mobius strip, 242, 247 
Mass, 330, 374 
centre of, 330 
Matrix, 114 

definite, 118 
determinant, 115 
diagonalisable, 116 
exponential, 466 
Hessian, 169 
identity, 115 
indefinite, 118 
inverse, 116 
Jacobian, 202 
minor, 115 
non-singular, 116 


Index 


norm, 115 
normal, 115 
orthogonal, 115 
rank, 115 
semi-definite, 118 
similar, 117 
square, 114, 115 
symmetric, 115 
transpose, 115 
Maximum, 174 
Measure, 304, 305 
Meridian section, 332 
Minimum, 174 
Minimum square error, 83 
Minor, 115 
Moment, 374 
of inertia, 331, 383 
Multiplicity 
algebraic, 116 
geometric, 116 


Neighbourhood, 120 
Norm, 521 
absolute value, 522 
Euclidean, 112, 115 
infinity, 35, 522 
maximum, 522 
quadratic, 80, 522 
sup-norm, 35 
supremum, 35, 522 
Normal, 225 
unit, 238 
Normal vector 
unit, 386 


ODE, 424 
separable, 430 
Operator 
differential, 211 
Laplace, 211 
Orbit, 427 
closed, 427 
Order, 424 
Orientation, 221, 241, 389 
Orthogonal functions, 523 


Parametrisation, 237, 240 
anti-equivalent, 240 
equivalent, 240 
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Parseval’s formula, 91, 97 of a series, 8, 14, 16 
Partition, 298 Peano, 172, 514 
Path integral, 375, 377 Resonance, 470, 476 
Phase space, 427 Riemann-Lebesgue Lemma, 91 
Plane 
osculating, 225 Saddle point, 179, 483 
tangent, 161, 238 Sawtooth, 92 
Point Sequence 
boundary, 120 convergent, 1, 34 
centre, 483 divergent, 1 
critical, 175, 479 geometric, 3 
equilibrium, 479 indeterminate, 1 
exterior, 120 numerical, 1 
extremum, 174 of functions, 34 
focus, 484 of partial sums, 4 
interior, 120 Series 
isolated, 123 absolutely convergent, 17 
limit, 123 alternating, 16 
maximum, 174 binomial, 51, 59 
minimum, 174 centre, 45 
node, 483 coefficients, 45 
regular, 175 conditionally converging, 19 
saddle, 179, 483 convergent, 5 
stationary, 175, 479 differentiation, 43, 54 
Polinomial divergent, 5 
trigonometric, 78 exponential, 45, 58 
Polygonal path, 124 Fourier, 75, 85, 96 
Polynomial general term, 4 
characteristic, 116, 461, 475 geometric, 6, 42 
Taylor, 171 harmonic, 10, 15, 17 
Positively-oriented triple, 114, 229 indeterminate, 5 
Potential, 209, 404 integration, 42, 54 
Power series, 45 Maclaurin, 56 
Problem Mengoli, 7 
Cauchy, 427 numerical, 4 
initial value, 427 of derivatives, 54 
Product of functions, 41 
Cauchy, 20 positive-term, 9 
scalar, 80, 523 power, 45 
Pseudo-derivative, 94 product, 20 
radius, 48-50 
Radius remainder, 14, 16 
of convergence, 48-50 sum, 41 
of curvature, 226 Taylor, 56 
spectral, 117 telescopic, 7 
Rank, 115 Set 
Region, 126 G-admissible, 386 
Remainder bounded, 124 


Lagrange, 172, 513 closed, 121 


compact, 124 
connected, 124 
convergence, 34, 41 
convex, 124 
level, 127, 268 
measurable, 304 
normal, 308, 323 
open, 121 
simply connected, 402 
star-shaped, 403 
Solution 
attractive, 479 
global, 427, 444 
of a differential equation, 424, 426 
stable, 479 
stationary, 479 
uniformly attractive, 480 
Stability, 478 
linearised, 488 
Sum, 41 
lower, 299 
of a series, 5 
upper, 299 
Surface, 142, 236 
S-admissible, 389 
boundary, 243, 247 
closed, 246 
compact, 381, 389 
exterior region, 247 
helicoidal, 144 
integral, 384 
interior region, 247 
level, 268, 273 
of revolution, 144, 382 
orientable, 241, 247 
piecewise regular, 247 
regular, 236, 237 
simple, 143, 237 
spherical, 144 
topographic, 144 
System 
autonomous, 480 
conservative, 450 
dissipative, 450 
fundamental, 457 
Hamiltonian, 453 
homogeneous, 456, 462, 466 
linear, 456 
non-homogeneous, 459, 470 


Index 


orthogonal, 80 
orthonormal, 81, 112 


Taylor expansion, 172, 513, 514 
Test 
absolute convergence, 18 
asymptotic comparison, 11 
comparison, 2, 9 
integral, 13 
Leibniz, 16 
ratio, 3, 12, 49 
root, 12, 50 
Weierstrass’ M-test, 43 
Theorem 
Abel, 49 
Cauchy-Lipschitz, 442 
comparison, 2 
Curl, 393 
Dini, 266, 518 
divergence, 391 
Fermat, 175 
Gauss, 391, 524 
Green, 393, 525 
Guldin, 333, 382 
Heine-Cantor, 515 
Jordan curve, 140 
Lagrange, 165 
Mean Value, 165, 314 
Pappus, 333 
Peano’s Existence, 440 
Pythagoras, 523 
Schwarz, 168, 512 
Stokes, 395, 526, 531 
substitution, 2 
Weierstrass, 174 
Torus, 147, 246, 334 
Trace, 142 
of a curve, 138 
of a surface, 142 
Trajectory, 427 
Triple 
positively oriented, 390 


Variable change, 328 
Vector, 111 
binormal, 226 
normal, 225, 238, 242 
orthogonal, 112 
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tangent, 218, 225 

torsion, 226 

unit, 112, 225 
Volume, 311, 332 


Wave 
rectified, 87, 95, 96 
square, 85, 95, 96 
Wedge product, 113 
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